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PSYCHOLOGICAL CONSEQUENCES OF BRAIN 
LESIONS AND ABLATIONS 
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AND HAROLD WILENSKY 
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In a previous paper, the senior 
author (164) reviewed the literature 
dealing with the psychological conse- 
quences of organic brain lesions and 
ablations through the year 1941. The 
intervening period of over ten years 
has witnessed considerable research 
in this area. In addition to the need 
to scrutinize and systematize the new 
and extensive literature relative to 
the problem of therapeutic psycho- 
surgery, it is also important to ob- 
serve significant trends in the de- 
velopment of new psychological 
evaluative techniques. Also, there 
have been notable modifications in 
the use and application of existing 
test methods in organic brain disease. 

In the previous review of the 
literature by Klebanoff (164), effort 
was made to formulate certain basic 
- generalizations. It appears appropri- 
ate now to examine critically these 
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conclusions in the light of the exten- 
sive research of the last decade. One 
is impressed by the striking develop- 
ment of new specialized test tech- 
niques. It seems important to eval- 
uate these new techniques in terms 
of their potential clinical value. Re- 
search of the last decade has stressed 
the potential importance of such 
variables as the patient’s age and 
the quality of interaction with the 
environment in producing differences 
in deficit in patients with apparently 
similar disease processes. 

In the light of all the above con- 
siderations, the present review has 
been undertaken. The general atti- 
tude underlying this survey is one of 
critical appraisal rather than mere 
review; the basic objectives are to 
organize, integrate, clarify, and ex- 
tract significant generalizations. Such 
an approach appears to be indicated 
if the basic conclusions in the area of 
psychological testing in organic brain 
disease and psychosurgery are to be 
evaluated in terms of essential con- 
tribution to scientific and clinical 
knowledge. 


BRAIN INJURY IN CHILDREN 


Psychological studies of persons 
who have suffered congenital brain 
injury, encephalitis, or other types of 
cerebral damage early in life have 
proven an increasingly fertile area of 
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research. The significance of research 
with brain-injured children, in addi- 
tion to its practical value for re- 
habilitation, lies in the possibilities of 
obtaining valuable understanding of 
the developmental aspects of brain 
function or dysfunction. Theories of 
normal mental development (135, 
180, 286) have stressed the fact that 
psychological functioning and cere- 
bral organization show an increase in 
differentiation and integration with 
age through adolescence. Although 
the less differentiated structure in 
the child leads to a more diffuse effect 
of injury if it occurs early, it also 
makes possible maturation with com- 
pensatory development of function. 
This negates the influence of an in- 
jury which would frequently have 
serious long-term sequelae in adults. 
Probst (222) followed up a group 
of 95 children with skull fractures 
occurring between ages one and fif- 
teen. After seven to fourteen years 
there was no evidence of impairment 
in school or vocational achievement, 
social adjustment, or intellectual de- 


velopment. Several investigators (50, - 


64, 84, 126, 163, 297) obtained varied 
results in brief case studies of func- 
tions disturbed by brain injury in 
rather young children. French (84) 
studied ten children who were oper- 
ated upon for cerebellar tumors with 
an extensive battery of generalized 
intelligence and specialized tests and 
found no evidence of impairment 
on deterioration-sensitive tests. Some 
children showed superior perform- 
ance. On the other hand, Cotton 
(64), employed a control group in 
studying the thought processes of 
spastic children and found striking 
differences between them and the 
normal control group. It is possible 
that ‘differences in severity of injury 
may well account for these divergent 
findings. An unusual case study 


described by Klapper and Werner 
(163) compared three children with 
cerebral palsy with their normal 
identical twins, using a large battery 
of tests sampling a wide variety of 
functions. The brain-injured chil- 
dren were impaired in every area 
when compared with their normal 
twins. 

The major portion of research o 
brain injury in children has emerged 
in the extensive work on the distinc- 
tion between endogenous and exog- 
enous feeblemindedness. This dis- 
tinction has proven of great heuristic 
value in stimulating an intensi 
study of the various dysfunctio 
consequent upon brain injury. As 
result of this research, it has been 
possible to develop fairly adequate: 
criteria for distinguishing these two 
types of defective children (32, 176, 
247, 256). This has resulted in the 
addition of a number of fairly sensi- 
tive and important techniques to the 
repertory of the clinician for measur- 
ing basic functions likely to be im- 
paired in early brain injury. 


Motor Functions 


The brain-injured child is generally 
subject to motor disturbances, since 
either cerebellar injuries or disturb- 
ances in the motor cortex are com- 
mon (32, 176, 247). Even when ob- 
vious motor disturbance is not pres- 
ent, the brain-injured child shows 
awkwardness and incoordination in 
performance. Heath (130, 131), com- 
paring exogenous and endogenous 
feebleminded children of comparable 
mental age in their performance on a 
simple rail-walking test, found that 
the exogenous children proved strik- 
ingly inferior. Descriptive studies of 
the motor performance of these 
exogenous  feebleminded charac- 
terize them as hyperkinetic, overac- 
tive, restless, impulsive, and inco- 

F 


BRAIN LESIONS AND ABLATIONS 


ordinated (247, 283). The impression 
gained from such descriptive studies 
suggests that motor difficulties are as 
much a function of fairly subtle dis- 
turbances in the integration of be- 
havior and the inhibition of overt re- 
sponsiveness as they are of specific 
damage to motor areas of the cortex. 
The use of the Van Der Lugt Scale 
(270) seems to offer potential for 
major research in the area of motor 
development and defect in the future. 


Perceptual F: unctions 


The tuost extensive and definitive 
studies of brain-injured children have 
been in iue area of perception. This 
work has been of theoretical as well 
as of practical importance since it has 
demonstrated striking relations be- 
tween perceptual and motor func- 
tioning that has led to suggestions for 
a sensory-tonic theory of perception 
(248, 282). Thus, Strauss (255), in 
summarizing three case histories, de- 
scribed perceptual and thought proc- 
cesses which mirror the forced re- 
sponsiveness, hyperkinesis, and in- 
coordination of the brain-injured 
child in his motor performance. 
Bender and Silver (33) describe the 
case of a brain-injured child with a 
severe modification in perception of 
the body. They conclude that the 
body image of the brain-damaged 
child is disturbed by “tonus pulls," 
equilibrium problems, and perceptual 
and integrative difficulties which 
heighten his social inadequacy. 

Werner and Strauss (292) studied 
normal children equated for mental 
age with endogenous and exogenous 
mentally defective children without 
gross motor or central visual defects. 
Phey used tachistoscopic presenta- 
tion of marred figures and copying of 
Patterns of marble boards or repro- 
duction of the patterns by drawing. 

Ithough the endogenous children 


T 


were genera a Fi the normal 
po 
réspónse 


group, ered strikingly in 
mode of from the exogenous 
children. ‘The latter made a great 


many errors that suggested failure to 
discriminate between figure and 
ground. 

Subsequent reinforcement of these 
findings has come from the investiga- 
tions of Lord and Wood (186) and 
Dolphin and Cruickshank (70). The 
latter studied perception of embedded 
figures and mosaic reproduction by 
matched normal and cerebral-palsied 
children of low-average intelligence. 
They attributed the significant in- 
feriority in perceptual performance of 
the children with cerebral palsy to 
forced responsiveness and meticulos- 
ity. Bensberg (42), in a careful study, 
compared familial mental defectives 
and brain-injured defectives using 
the marble-board procedure. The 
brain-injured group proved signifi- 
cantly inferior in accuracy and tended 
more often to show "jumps" to new 
lines in their drawings. The results 
continue to point to pathological 
figure-ground perception as a basic 
difficulty consequent on brain injury. 
Werner (283), studying the per- 
formance of the two feebleminded 
types on the Rorschach, noted sig- 
nificantly more white space responses 
(S) for the exogenous group. Case 
studies by Strauss and Werner (257, 
258), and Werner and Carrison (289), 
demonstrate ingeniously the inter- 
relationships of defective figure- 
ground formations in brain-injured 
children and other symptoms such as 
finger agnosia and arithmetic dis- 
ability. 

Similarly, these brain-injured chil- 
dren showed inability to grasp and 
retain visual patterns made up of dis- 
crete elements. The authors at- 
tributed this to a basic difficulty in 
integrating several aspects of a visual 
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stimulus. This is a defect similar to 
that reported for a brain-injured 
adult by Nichols and Hunt (205) and 
considered a basic consequence of 
severe brain pathology by Hunt and 
Cofer (145) in their review of psy- 
chological deficit. A case study by 
Schilder (243) has similar possibilities 
for interpretation of this deficit. 
Brief mention may be made of ad- 
ditional perceptual difficulties mani- 
fested by  brain-injured children. 
Werner and Thuma (295) compared 
equated groups of feebleminded rep- 
resenting the two etiological types 
'on performance in critical flicker 
fusion perception. Results showed 
the exogenous feebleminded fell sig- 
nificantly below the familial group 
in critical flicker frequency at all 
three brightness levels. This finding 
is quite important in view of exten- 
sive research on adults reported by 
Teuber (263) and Halstead (120). 
Similarly, a marked defect in the per- 
ception of apparent or illusory mo- 
tion, the phi phenomenon, and tachis- 
toscopic presentation of stick figures 
in imbalance was found among the 
exogenous feebleminded by Werner 
and Thuma (294). These children 
also showed fewer movement per- 
cepts on the Rorschach than an 
equated group of endogenous feeble- 
minded. In one of the few studies in- 
volving auditory perception in brain 
pathology, Werner and Bowers (288) 
found that the brain-injured feeble- 
minded were deficient in discriminat- 
ing melodic patterns, revealing much 
the same disturbance in figure-ground 
pathology noted above. 


Thought Processes 


Sorting tests have proven increas- 
ingly useful in studying thought in 
process and in its effect. Strauss and 
Werner (259) employed the Halstead 
sorting test to determine if the find- 


ings with brain-injured adults were 
comparable to those obtained wi 
brain-injured children. Twenty pairs 
of exogenous and endogenous feeble- 
minded children were matched for 
mental age and intelligence. The 
brain-injured children formed more 
relationships between objects in the 
voluntary sorting but accomplished: 
this by forming "singular and un- 
usual combinations." In choosing: 
objects to go with rather unambigue 
ous pictures, the brain-injured chil- 
dren also selected uncommon ob 
jects. They deviated more from 
standard names, organized objects 
into circumscribed units, overstressec 
orderliness, and showed extreme cone 
creteness. In contrast with the endog- 
enous children and particularly with 
a group of healthy children, the: 
brain-injured children were attracted 
to “properties of objects apt to elicit 
motor responses.” The latter finding 
seems important in view of the hy- 
perkinetic tendencies ascribed to 
brain-injured children. 

In a further study of thought 
processes, Werner and Carrison (290) 
employed a standard procedure for 
studying animism or the tendency to 
perceive natural events or objects as 
living. The results, based upon levels 
of thought development derived from 
the work of Piaget, showed signifi- 
cantly more animistic responses by 
the exogenous children. In keeping 
with certain concepts of Goldstein 
(93) and Werner (286), the animistic 
tendencies of brain-injured children 
may be looked upon as functions of 
a greater rigidity and concreteness 
which prevent them from detaching 
themselves from objects and events. 
They are thus unable to differentiate 
between their own feelings and those 
of the surrounding world. Evidence 
that these tendencies are a result ol 
impairment due to brain inj 
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rather than merely representing a 
particular type of intellectual limita- 
tion is forthcoming in a study by 
Cotton (64), who compared physi- 
cally normal children with spastic 
children of average intelligence. Re- 
sults on a series of sorting, verbal 
completion, and patterning tests 
showed a decided inferiority for the 
spastics whose responses were more 
stereotyped and who were less in- 
clusive in sortings and more given to 
highly personal associations and 
other signs of concreteness. These 
findings were particularly associated 
with speech difficulties and were 
found less often when the congenital 
injury was restricted to motor areas. 
The problem of the rigidity or per- 
severation of brain-injured or feeble- 
minded generally has been discussed 
theoretically by Lewin (180) and 
Kounin (173). The latter carried out 
several significant studies demon- 
strating differences in rigidity pat- 
terns as a function of chronological 
and mental age. It is not clear 
Whether the subjects used by Kounin 
were exogenous or endogenous feeble- 
minded, although apparently they 
were physically normal as deter- 
mined by medical examination. 
Werner (284), summarizing four ex- 
periments comparing rigidity pat- 
terns of brain-injured and familial 
feebleminded, observed more per- 
Severation in the former group. He 
characterized the rigidity of the 
brain-injured as abnormal; it involved 
a trend away from the global ap- 
Proach toward isolation of elements 
of a series which became self- 
Contained and were repeated mo- 
notonously or “jumped” suddenly 
zu the foreground. The familial 
minded, on the other hand, 
wed subnormal rigidity related to 

a relative lack of differentiation. 
hey showed predominantly global 


behavior, perception, or action, or- 
ganized as undifferentiated wholes so 
that situations not sharply set apart 
were easily fused, resulting in stereo- 
typed responses. 

Personality 


There have been comparatively 
few attempts at extensive personality 
evaluations of brain-injured children. 
Werner (283) studied the Rorschach 
test performance of brain-injured 
feebleminded and found that they 
showed a restriction of creative fan- 
tasy activity, strong tendencies to- 
ward explosive emotionality, nega- 
tivism, and the disintegrative be- 
havior described above. Colm (62), 
employing the Lowenfeld Mosaic 
Test, compared normal and brain- 
injured children. She described the 
brain-injured children as “stimulus 
bound,” with an inability to shift and 
a repetitiveness which indicated an 
impairment in abstraction. Thus, 
these children showed simple additive 
placement of the mosaic pieces using 
side-by-side or color-by-color pat- 
terns and showed difficulty in formu- 
lating and executing a plan for a de- 
sign. Colm indicated an important 
distinction between the exogenous 
children and compulsive or autistic 
children who also showed repetitious- 
ness. The latter perseverated chiefly 
on words with strong emotional con- 
tent while the brain-injured children 
repeated almost any phrase regard- 
less of the emotional tone. 

In conclusion, the extensive re- 
search on brain-injured children in- 
dicates defects that are generally like 
those described for brain-injured 
adults. The children show inability 
to develop abstract functions, patho- 
logical rigidity, lack of creative imagi- 
nation, stimulus bondage, and disin- 
tegrative behavior in all areas. Brain- 
injured children also show strikingly 
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unequal development of capacities, 
suggesting some cerebral localization 
of damage, but the basic defects in 
abstraction hinder any possibility of 
really integrated development for 
many. 


ORGANIC PSYCHOSES AND DE- 
GENERATIVE DISEASES OF 
THE NERVOUS SYSTEM 


Despite the increased interest in 
problems of the aged in the past dec- 
ade, there has been little extensive 
research dealing with psychological 
deficit or emotional changes con- 
comitant with senescence. A major 
trend in the past decade has been an 
awareness of some decline in intel- 
lectual functioning with increased 
age (151, 274). There has been little 
attempt, however, at precise analysis 
of the nature of this decline and its 
relationship to emotional stability 
and the psychological milieu. 

Fortunately, some studies on the 
organization of mental life in organic 
psychoses are beginning to appear. 
Botwinick and Birren (49), for ex- 
ample, compared the intellectual per- 
formance of hospitalized seniles with 
that of normal persons of the same 
age range (60 to 70 years) and with 
that of a matched group of young 
normals. They used the Wechsler- 
Bellevue and Babcock-Levy Revision 
to test for deterioration. These in- 
vestigators concluded that while dif- 
ferences between the two aged groups 
existed, the older persons were far 
more like each other than were the 
normal aged and the younger group. 

Of greater theoretical interest, per- 
haps, are studies attempting to de- 
lineate the pattern of mental change 
consequent on the fairly diffuse de- 
generative brain changes that seem to 
occur with cerebral arteriosclerosis 
and similar diseases. Thus, in studies 
comparing 100 male senile dements 


with an average age of 73, and em- 
ploying such tests as Raven's Pro: 
gressive Matrices, Eysenck (72, 73) 
found considerable deterioration im 
the patients in terms of healthy adult 
and child norms. A factor analysis 
of the results showed a general factor 
and three factors relating to speed, 
memory, and physical strength which 
could be identified fairly readily. 
Tests of abstract reasoning pro 
most sensitive to deterioration 
contrast to tests involving memory 
and stable knowledge. A differentia 
deterioration in abilities appeared 
occur along the lines suggested many 
years ago by Hughlings Jackson 
(149) in which the diffuse cerebral 
damage leads to initial loss of the 
most complex functions which de- 
velop most recently in the evolution: 
of the nervous system.” In this con 
nection, Halstead (115) studied senile 
dements over 70 with an extensive 
battery of psychological tests. There 
were two groups in different stages of 
dementia. Tests discriminating be- 
tween the two groups involved the 
more complex functions. The testsi 
cluded Porteus Mazes, Knox Cub es, 
Block Designs, and various recent 
memory tests. 1 
The impoverishment of function- 
ing in senile psychotics was brought 
out in a study by Cleveland anc 
Dysinger (57). These investigators 
studied the performance of institu- 
tionalized seniles, whose average a 
was 75, with the Wechsler-Belle 
scale and an object-sorting test. The 
patients manifested decided diffi 
culties in assuming an abstract att 
tude and could not sort on a com 
ceptual basis. That even verbal be 
havior suffers impairment in senile 
dementia was demonstrated by Feld 
man and Cameron (75) and Ackels- 
berg (3). The former investigator: 
compared the speech of senile psy 
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chotics with that of normal adults 
and children. Although based on 
rather small samples, this study does 
suggest that seniles have difficulty in 
dealing with abstract terms and rely 
most heavily on concrete nominal 
forms or words of action to the neg- 
lect of adjectives which involve im- 
aginative processes. Ackelsberg (3) 
obtained comparable findings in vo- 
cabulary studies of seniles. Recent 
studies by Pinkerton and Kelly (217) 
and Hall (113) contrasted deteri- 
orated seniles with children and de- 
pressives by the use of various sort- 
ing tests and supported the thesis 
that a loss of conceptual ability oc- 
curs early in the course of brain de- 
generation while more general in- 
tellectual test performances remain 
relatively intact. 

Diffuse degeneration of the cortex, 
particularly in the frontal areas, oc- 
curs in general paresis. The behavior 
of paretics in the classic description 
of the disease, while more flagrantly 
psychotic, does resemble that of the 
older senile dementia or arterio- 
sclerotic patients. With the improved 
methods for treating syphilis and ar- 
resting general paresis, a re-evalua- 
tion of the description of the disease 
may well be necessary. 

Psychological studies of patients 
with general paresis have been fairly 
frequent in the past decade and have 
provided suggestive findings. Studies 
with the Wechsler-Bellevue scale 
(88, 195) as well as with the CVS 
abridgement (198) have suggested 
the usefulness of these tests in de- 
roamed the intellectual performance 

; Paretics. While there were indi- 
vidual differences in the degree of im- 
Pairment on the various subtests, the 
gpoeral findings suggested lowered 
Es “aie ability, greater diffi- 

d 1n new learning, verbal compre- 

Sion, and concept formation in the 


case of the paretic group. Trist, 
Trist, and Brody (269) used a large 
battery of cognitive tests in compar- 
ing normals, neurotics, and paretics 
and observed that the paretics found 
it difficult to ignore irrelevant detail, 
and a flowing together of figure and 
ground was evident in their perform- 
ance. 

A somewhat novel approach to the 
psychological study of paresis is of- 
fered in studies by. Rashkis, Cush- 
man, and Landis (225) and Rashkis 
(224). The former study involved the 
administration of a word-sorting test 
to normal adults, normal children, 
general paretics, and schizophrenics 
equated for educational achievement. 
On the basis of an analysis of the 
sortings, only the normal adults were 
able to assume the abstract attitude. 
Children and schizophrenics func- 
tioned at the complex, ‘‘pseudo- 
conceptual" level, while the paretics 
were unable to reach even that de- 
gree of attainment. From the stand- 
point of volitional attitude, the 
schizophrenics more closely resem- 
bled the normal adults while the 
paretics resembled the children. 
Rashkis (224), employing more care- 
ful controls, amplified and extended 
these findings. Rashkis studied three 
groups consisting of schizophrenics, 
general paretics, and cerebral arterio- 
sclerotics. The schizophrenics proved 
somewhat better “coordinated” and 
more capable of accounting for their 
performance. The arteriosclerotics, 
although “uncoordinated,” also at- 
tempted to account for their per- 
formance, in contrast to the paretics 
who were “uncoordinated” and of- 
fered no excuse or apology. 

In one of the few attempts to study 
the personality characteristics of 
general paretics, Klebanoff (165) em- 
ployed the Rorschach inkblots to 
compare findings between a homo- 
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geneous group of paretics and a nor- 
mal control group matched for age 
and education. The paretics lacked 
sufficient intellectual drive to func- 
tion at an abstract level. They 
showed concrete mental activity 
which lacked accuracy, conformity, 
and "fundamental adaptivity to en- 
vironmental needs.” 

A promising and somewhat dif- 
ferent approach to this problem is 
suggested by the preliminary study 
of Wittenborn, Bell, and Lesser (300) 
who carried out a factor analysis of 
symptom ratings along various scales 
for deteriorated organics, hebephren- 
ics, and young patients with func- 
tional psychoses. Different symptom 
clusters emerged with only anxiety 
and paranoid factors coinciding for 
the groups. Deterioration was most 
marked among the organic patients. 

Research is just now beginning to 
appear in connection with presum- 
ably subcortical neurological degen- 
eration such as is found in multiple 
sclerosis. Although somewhat con- 
tradictory results have emerged with 
respect to possible mental deteriora- 
tion concomitant with multiple 
sclerosis (25, 53, 68), differences in 
the populations studied and in the 
duration of the disease may account 
for these discrepancies. Canter (53), 
for example, found definite evidence 
of deterioration among patients in 
the early phases of the disease when 
compared with both their own mili- 
tary induction test scores and a con- 
trol group of clinic employees. Diers 
and Brown (68) reported no evidence 
of decline, while Baldwin (25) re- 
ported loss of abstract ability in some 
patients and no apparent decline for 
many others. 

It is in the area of personality that 
a striking uniformity emerges in three 
separate studies. A brief report by 
Harrower (127) was based on com- 


parisons of Rorschach and Szondi 
findings for multiple sclerotics, con- 
trol groups of normals, emotionally 
unstable psychosomatic patients, and 
patients with poliomyelitis and Park- 
inson's disease. It was found that the 
multiple sclerotic patients differed 
from the other groups in showing 
least concern with bodily symptoms, 
extreme dependency, resignation, de- 
nial of conflicts, and a need to excite 
sympathy. This general similarity 
to the classical picture of the hysteric 
patient with belle indifference also 
emerged in Rorschach studies by 
Blatt and Hecht (48) and an inten- 
sive study by Baldwin (25) of family 
background, premorbid factors, and 
data from Minnesota Multiphasic 
Personality Inventory patterns. The 
general similarity of the multiple 
sclerotic and neurotic hysteric is par- 
ticularly striking in the early phases 
of the disease (25) and suggests the 
importance of a premorbid personal- 
ity factor. In view of similar findings 
with idiopathic epileptics (91), an im- 
portant area of research in the psy- 


chosomatic implications of neuro- 


logical disease appears to require ex- 
tensive exploration. 


BRAIN Tumor, SURGICAL ABLATION, 
AND CEREBRAL TRAUMA IN 
ADULTS 


Disturbances in Perception 


The majority of studies have been 
restricted to the field of visual percep- 
tion. For excellent reviews of mate- 
rial on the borderline of visual sensa- 
tion and perception, the reader is re- 
ferred to Bender and Teuber (40) 
and Teuber and Bender (267). Hal- 
stead (120) has ascribed important 
theoretical significance to the critical 
flicker fusion frequency as a result 
of factor analyses of the test per- 
formances of normal individuals. On 
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the basis of intercorrelations between 
the critical flicker frequency and per- 
formance and other techniques, Hal- 
stead (120) has tentatively defined a 
power factor (P) which indicates the 
kinetic capacity of the individual, 
not unlike the "central vigilance" 
described by Head (128) in his 
studies of aphasia. Halstead (117, 
119) has reported evidence that indi- 
viduals of equal IQ may differ mark- 
edly in the P factor, which is sensitive 
to disruption by the presence of rela- 
tively small lesions in the brain and 
under conditions of low-grade anoxia 
or fatigue. Since Halstead indicates 
that a lowered critical flicker fre- 
quency is one of the prime indicators 
of a depressed level of the P factor of 
biological intelligence, the value of 
this simple laboratory procedure for 
psychological evaluation of the con- 
Sequences of brain lesions is 
enhanced. eue 
Unfortunately, the findings of Hal- 
Stead have not been fully confirmed. 
Battersby (27, 28) and Battersby, 
Bender, and Teuber (29) obtained 
negative results in studies of veteran 
patients with frontal lobe head 
Wounds. These patients failed to 
show any significant differences in 
critical flicker frequency level when 
Compared with an equated control 
group. On the other hand, a series of 
Studies (28, 29, 38, 40, 265, 266, 303) 
as shown definite lowering in critical 
E frequency after occipital lobe 
cl * This impairment was de- 
ius “a areas that appeared normal 
tos outine perimetric examina- 
n. Bender and Teuber (38, 40) 
ve reported that defects emerged 
us Eu wenn ecc flicker 
pue 8 tested in the periphe 
p Vision rather than in ihe maila 
Eu. "These results, while confirm- 
b E of Halstead (120) in raising 
concerning the significance of 


an anatomic point-to-point projec- 
tion in visual structures, are difficult 
to reconcile with his observations on 
frontal lobe patients. The differences 
may be a function of procedural vari- 
ations or, as Battersby (28) has sug- 
gested, the result of differing patient 
groups. 

Case studies comprise the bulk of 
the studies on perceptual response 
following brain injury. Rather chal- 
lenging for general theories of percep- 
tion are observations of a phenome- 
non tentatively labelled "extinction," 
in which patients confronted with an 
object in their left-half field report 
that it disappears as soon as another 
object is shown in their right-half 
field (34, 36, 303). As Hebb (136) has 
noted, these phenomena are difficult 
to assimilate into current perceptual 
theories. Bender and Furlow (34) de- 
scribe the manner in which tendencies 
to reorganize a psychological field of 
vision about a subjective center make 
it difficult for a patient with occipital 
lobe damage to recognize that he has 
lost central vision. i 

Problems of unusual spatial reor- 
ganizations brought out only by spe- 
cial testing devices are described in a 
number of cases of occipital injury. 
Paterson and Zangwill (211) ob- 
served in two cases of right parieto- 
occipital lobe injuries a tendency to 
overestimate the distance of very 
near objects and to underestimate the 
distances of far objects, although the 
ability to appreciate depth and dis- 
tance and the "implicit awareness of 
space" were not affected. Analysis of 
spatial structure was difficult, and 
various spatial relations tests and the 
Kohs Blocks brought out the defects 
in the patient. Drawings of complex 
objects were carried out piecemeal 
with poor articulation of subwholes. 
Temporal disorientation, also found 
by Coheen (58) with a larger number 
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of patients and various control 
groups, was also present. Since most 
case studies in this group represent 
traumatic injuries, a report by Sten- 
gel (254) of consequences of vascular 
lesions with an eclampsia in a 40- 
year-old woman is interesting since 
quite severe spatial reorganization 
occurred. Extensive testing revealed 
that for this subject the complex spa- 
tial organization of the outside world 
had been replaced by a most primi- 
tive organization in which “nearness” 
served as the only measuring scale. 

A series of cases of parietal and oc- 
cipital lobe injuries reported by 
Bender and Teuber (38, 40) is par- 
ticularly valuable because of the 
careful attempts by the authors to 
relate findings to a general theory of 
brain function. On the basis of an 
intensive sampling of increasingly 
complex perceptual performances, 
the authors report such defects as 
disturbance (limited to the lower left 
quadrant for some patients) in local- 
izing objects, teleopsia with a ten- 
dency toward an excess of depth in 
the subjective field, higher threshold 
for phi, lowered critical flicker fre- 
quency, and tendency to enlarge un- 
familiar objects. For some patients 
the subjective coronal plane was ro- 
tated as a whole, away from them on 
the left, toward them on the right. 
This shifting could be compensated 
for by one patient in daily life, while 
for another, subjective visual space 
no longer coincided with his tactile or 
locomotor space and led him into 
many constant errors in pointing. 
The authors feel that these results 
argue against associational theories 
and in favor of “vector” theories gen- 
erally related to Kóhler's (170) views 
of brain function. 

In an experimental attempt to test 
certain. hypotheses concerning cere- 
bral function related to Kóhler's the- 


ories, Klein and Krech (166) am 
Jaffe (150) independently carried out: 
recent studies of the kinesthetic fig- 
ural aftereffects in matched brait 
injured and control groups. Jaffe, 
using patients with traumatic head 
wounds, found no differences be 
tween controls and patients. Klein 
and Krech tested the hypothesis th 
both concreteness and figural after- 
effects in the brain-injured were func- 
tions of reduced “cortical conductiv 
ity” (174). They predicted more 
pronounced figural aftereffects that 
would appear sooner and last longer 
for subjects of varied degrees of brain 
pathology than for equated controls. 
Results supported the hypotheses 
and, in addition, high positive cor- 
relations were found between neurol- 
ogists’ ratings of extent of damage 
and the vividness and duration of the 
effect. The differences in results of 
the two studies may be in part a 
function of different patient popula- 
tions, traumatic head wounds as 
against surgical ablations, a persist- 
ing obstacle to the reconciliation of 
data in this field. 

There has as yet been little effort 
to relate perceptual defects in brain- 
injured patients to more general 
areas of functioning such as motiva- 
tion and personality. Critchley (66) 
has stressed the relation of body 
image to alteration in cerebral func- 
tioning following parietal lesions. 
Halstead (118) has shown a possible 
relationship of lowered and fairly 
rigid critical flicker frequency and 
poor performance on the Dynamic 
Visual Field test to severe judg- 
mental defects in a man following 
unilateral lobectomy. More recently, 
Weinstein and Kahn (278, 279, 280) 
have investigated motivational and 
interpersonal aspects of the per 
ceptual response of 50 patients with 
bilateral lesions of neoplastic or vas- 
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cular origin. These patients showed 
distortions not only in body image, 
but in self-concepts and manifested 
tendencies to deny illness or impair- 
ment and to misperceive stimuli re- 
lated to their disability or hospitali- 
zation. The denial of affective in- 
volvement sometimes reported fol- 
lowing lobotomy may represent a 
similar phenomenon and quite a seri- 
ous threat to future adjustment 
since it substitutes an equally un- 
realistic self-orientation for the previ- 
ous “tortured self-concern.”” 
Summarizing results in psycho- 
logical studies of perceptual function- 
ing following brain lesions and abla- 
tions, it is apparent that the move- 
ment toward specialized testing 
techniques pointed out by Klebanoff 
(164) has progressed rapidly. Per- 
ceptual research with brain-injured 
patients suggests that the repertory 
of the clinical psychologist in the ex- 
amination of neurological patients 
may soon be augmented by the use 
of specialized laboratory methods 
such as tachistoscopic presentation, 
critical flicker fusion frequency test- 
ing, apparent movement thresholds, 
and other so-called ‘brass instru- 
ment" techniques. The value of these 
perceptual techniques in localization 
of lesions remains equivocal. It seems 
clear, nevertheless, that subtle per- 
ceptual impairments or reorganiza- 
tions following cerebral injury are 
most clearly brought out by intensive 
laboratory examination rather than 
by the widely used global clinical 
methods like the Rorschach, Bender- 
Gestalt, Wechsler-Bellevue, Hunt- 
Minnesota, etc. These laboratory 
techniques or those outlined by 
Goldstein (95) are now chiefly avail- 
able in only a few research centers 
(120, 263, 307). Before long, with the 
aid of these new techniques, the clini- 
Cal psychologist may be in a position 


to play an ever more vital role in a 
neurological setting. 


Memory and Attention 


The past decade has provided com- 
paratively little intensive research on 
specific memory or attention disturb- 
ances in brain-injured patients. The 
emphasis, rather, has been on incor- 
poration of these functions within 
broader categories such as perception 
or general intelligence. Certainly, 
many disturbances of immediate 
memory may actually reflect difficul- 
ties in attention to presented mate- 
rial. The distractibility and stimulus 
bondage which are reported to bechar- 
acteristic of both brain-injured adults 
and children (93, 94, 291, 293) would 
naturally impede original learning of 
material. Hall and Crookes (114) 
compared a heterogeneous brain-in- 
jured group with normals and schizo- 
phrenics in learning ability and found 
some qualitative suggestion of im- 
pairment in the organic patients, al- 
though differences between the or- 
ganic patients and the schizophrenic 
patients were not definitive. Ruesch 
and Moore (234) studied 190 patients 
immediately following head injury 
and reported that the serial sub- 
traction test ("100-7") proved most 
sensitive, suggesting that the ability 
to maintain sustained effort is par- 
ticularly sensitive to cerebral dys- 
function, whether momentary or per- 
sistent. Similarly, other investigators 
have found evidence of memory loss 
dependent largely upon failures in 
attention and immediate retentive- 
ness (4, 10, 20). Tests such as Hal- 
stead's Formboard Retention and 
Dynamic Visual Field (120), Teu- 
ber's Field of Search (263), and Ben- 
ton's Visual Retention (43) have 
revealed fairly clear evidence of at- 
tention and immediate memory de- 
fects in cases of brain trauma or 
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surgical ablation. Definitive stand- 
ardization of these procedures in the 
future should prove them valuable 
clinical aids. The major problem of 
devising laboratory and clinical pro- 
cedures capable of discriminating 
between certain schizophrenic pa- 
tients and individuals with various 
kinds of brain pathology remains a 
difficult methodological considera- 
tion. 

Mention should be made of some 
tests designed to detect brain dam- 
age by memory techniques (69, 102, 
144). Hunt (144) has developed an 
extensive battery employing as a 
base the 1937 Stanford-Binet with 
verbal and nonverbal learning and 
recall tests. The test was developed 
on patients with diffuse brain injury, 
but subsequent research (4, 20, 154, 
196) suggests that the test does not 
discriminate well. 

Employing an omnibus memory 
test approach, Graham and Kendall 
(102), while reporting significant dif- 
ferences between a brain-damaged 
group and a neurotic group, found 
impairment present in only half of 
the organics. In a careful study by 
Cohen (59) no measures of the 
Wechsler Memory scale discriminated 
between neurotics and patients with 
intracranial pathology. These nega- 
tive findings do not necessarily indi- 
cate that memory defects are uncom- 
mon among patients or that memory 
scales are without value in neurologi- 
cal testing. These scales have un- 
doubted usefulness as part of an indi- 
vidual testing battery in the hands 
of an experienced clinician. It is 
clear, however, that the complexity 
of brain functioning and the subtlety 
of psychological sequelae of organic 
brain impairment severely limit any 
research or clinical studies with om- 
nibus memory scales. 


General Intelligence 


Although reports on general intel- 
lectual consequences of brain injuries 
or diseases are considerable in quan- 
tity, the past decade has seen little 
progress in the treatment of this 
issue. Critical case studies (129, 132, 
202, 207) have suggested that general 
intelligence as measured by standard 
scales may be normal or actually im- 
prove following large but clean ex- 
cisions of previously pathological 
brain tissue. Concern in employing 
intelligence tests has been, with cer- 
tain exceptions, limited to the Wechs- 
ler-Bellevue Intelligence Scale and 
various measures of deficit deriv 
from this test. Recognizing that in- 
telligence in general may not show & 
change after brain lesion, investiga- 
tors, following the clinical suggestions 
of Wechsler (275), have attempted to 
tease out patterns of test perform- 
ance capable of differentiating the 
brain-injured or diseased from nor- 
mal individuals. Unfortunately, as 
Rabin and Guertin (223) have noted 
in their review of Wechsler-Bellevue 
research, the Mental Deterioration 
Index (MDI), originally developed 
on the basis of decline in scores with 
increasing age among normals, has 
been translated too hastily to indi- 
cate decline following organic brain 
damage. 

A brief report on clinical cases by 
Levi, Oppenheim, and Wechsler (178) 
indicated, for example, the practical 
usefulness of the MDI in differenti- 
ating patients with organic lesio 
from patients with hysterical an 
other psychogenic disturbances. Al 
though cautious in their generaliza 
tions, these authors did raise an im 
portant principle for use in studyin 
deterioration. The concept of tests 
that “hold up" with age and heni 
may conceivably be less resistant 
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the inroads of brain damage is a chal- 
lenging one. The question remains, 
however, as to the specific nature of 
"hold" and "don't hold" tests and 
to the quantitative relationships 
necessary to suggest pathological de- 
terioration. In its purely quantita- 
tive form, the MDI has not been 
shown to be sufficiently discriminat- 
ing in individual determinations of 
brain-injured and normals (8, 15, 47, 
110, 156). 

It should be noted, however, that 
the general principle of the contrast 
of "hold" and “don’t hold" tests 
seems to be operative and that the 
majority of brain-injured are prop- 
erly diagnosed by its use. For indi- 
vidual prediction, however, reliance 
on quantitative scores alone seems 
relatively futile. Anderson (15, 16) 
compared clinically substantiated 
cases of focal lesions in the dominant 
hemisphere with comparable cases of 
lesions in the nondominant hem- 
isphere. He found that while the 
MDI did not yield very reliable 
quantitative discrimination, differ- 
ences were in the expected direction. 
Brain-injured patients with focal le- 
sions of the dominant hemisphere 
were more easily picked up by the 
use of the index than were those 
with nondominant lesions. On the 
basis of several studies and a com- 
parison of brain-injured and brain- 
diseased patients, Allen (7, 8, 9, 10, 
11) suggested a modification of the 
index, but other studies have pro- 
Vided data which suggest that this 
modification is not significantly more 
discriminating than the original (4, 
47, 230). Other attempts at develop- 
ing deterioration indices (138, 227) 
have been evaluated by Gutman 
(110), who found that a rather com- 
Plex method developed by Hewson 
(138) agreed fairly well with clinical 
diagnoses of brain damage. 


Far more significant perhaps than 
the empirical derivation of complex 
indices (which are not very dis- 
criminating and have not been cross- 
validated by the investigators) is evi- 
dence that certain functions appear 
again and again in the list as particu- 
larly susceptible to impairment by 
brain injury. Almost all investigators 
have noted that Digit Symbol, repre- 
senting disciplined psychomotor 
learning, is most clearly affected. 
Also easily impaired by brain injury 
are Digit Span, a test of recent mem- 
ory or attention (7, 9, 227, 233, 234); 
Block Design, testing analytic and 
synthetic capacities (4, 7, 9, 92, 107, 
183); and Arithmetic (4), testing 
concentration and simple numerical 
facility. Since these subtests are most 
easily affected by anxiety or other 
emotional disturbances, a great deal 
of overlap between various clinical 
groups (223) is almost inevitable. 

Too much of the research in this 
field has been excessively test-bound 
and empirical, relying heavily upon 
omnibus scales of general intelligence. 
New concepts like Cattell’s ‘“‘crystal- 
lized” and “fluid” abilities (54), sim- 
iar views of Hebb (135) related to 
his physiological rationale, and Hal- 
stead’s four factors as components of 
“biological intelligence” (120) should 
provide useful theoretical;bases for 
more precise explorations. 

Halstead provided some data to 
suggest a differential influence of 
brain lesions on his four factors of 
biological intelligence, with the P 
factor considered the most sensitive 
and disturbed, particularly in pre- 
frontal lobe lesions, in his data. Hal- 
stead's results also suggest a gradient 
of impairment in biological intelli- 
gence with the frontal areas most 
sensitive. This hypothesis needs ex- 
tensive testing since somewhat con- 
tradictory data are available from 
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other studies (29, 40). Nevertheless, 
Halstead's work poses a challenge to 
clinical psychologists to reconsider 
their techniques and their excessive 
reliance upon omnibus tests or tech- 
niques standardized for other pur- 
poses. 


Disturbances in Thought and Lan- 
guage 


In the past decade, research has 
continued to suggest that a major 
consequence of brain lesions and ab- 
lations isa disturbance in thought and 
language, classified as a loss of ab- 
stract or conceptual ability in the 
viewpoint popularized by Goldstein 
(94, 100). The directions taken by 
research in this period have involved 
(a) more precise exploration of this 
basic defect by the use of refined 
procedures (100, 113, 120, 121, 141, 
264); and (b) attempts to ascertain 
whether the loss of abstract ability 
was particularly a consequence of 
frontal lobe pathology. 

Extensively studied cases of both 
frontal and diffuse brain pathology 
have been reported (2, 44, 118, 123, 
205, 306) in which patients mani- 
fested fairly general signs of impair- 
ment in abstract behavior as meas- 
ured by procedures like the Weigl 
color-form sorting, the object-sorting 
tests, the Shipley-Hartford Con- 
ceptual Quotient, the Kohs Blocks, 
etc. 

A striking and well-studied nega- 
tive case was presented by Hebb 
(134) in which the patient was op- 
erated upon for removal of frontal 
scar tissue that had led to near-psy- 
chotic behavior. This patient im- 
proved subsequent to a fairly clean 
but extensive bilateral frontal lobec- 
tomy. He showed no clear signs of 
unusual disturbance in abstract be- 
havior when presented with a host of 
psychological tests, many of which 


tapped conceptual processes. Differ- 
ences between Hebb's case and cases. 
discussed by Goldstein (98) may 
in the nature of the injury and 
possible persistence of pathological 
tissue in some instances. 

The value of various tests of ab- 
straction in clinical neurological test- 
ing has been brought out in a number 
of studies (95, 97, 108, 139, 307). 
Greenblatt, Levine, and Atwell (108) 
found that, in comparing patients 
with known heterogeneous types 
brain damage and patients without 
brain damage, abstraction tests like 
the Kohs Block, Weigl form-color, 
and Shipley-Hartford were extremely 
successful in differentiation, and, in 
combination with the somewhat less - 
accurate EEG, they could disc-im- 
inate almost perfectly between 
groups. The tests were used clin- 
ically and the specific contribution of 
each is not presented, unfortunately. 
Similarly, Hoedemaker and Murray 
(139) noted almost perfect discrim- 
ination of 16 brain-injured from 16 
schizophrenics and 16 neurotics by 
the use of a battery including the 
Wechsler-Bellevue, Rorschach, Szon- 
di, Ellis Visual Designs, and B.R.L. 
Sorting tests. Inspection of their 
breakdown of the various functions 
tapped reveals that thought process 
disturbance (tapped chiefly by the 
sorting tests) and memory were most 
consistently found and most ac- 
curate in differentiating. Electro- 
encephalograms were less accurate, 
but the test battery combined with 
EEG and routine neurological exam- 
ination led to perfect discrimination. 

Although originally devised as a 
nonverbal intelligence test, the Kohs 
Block test has proven a valuable and 
interesting technique in diagnosis of. 
brain-injured patients. ^ Wechsler 
(275) reported it to be one of the most 
useful subtests in his intelligence - 


BRAIN LESIONS AND ABLATIONS 15 


scale for this purpose, while almost all 
the investigators using the Wechsler- 
Bellevue scale with brain-injured pa- 
tients indicate severe impairment on 
the Block Design subtest (4, 7, 9, 92, 
107, 183). To the extent that a com- 
bination of analytic and synthetic 
capacities is demanded of the subject 
by this test, it may measure the pri- 
marily abstract components of gen- 
eral intelligence. Thus, a study by 
Lidz, Gay, and Tietze (183) has 
shown a significant difference in men- 
tal age scores obtained from Vocabu- 
lary tests and the Stanford-Binet and 
the mental age scores obtained from 
the Kohs Blocks. A comparable 
group of 15 schizophrenics showed no 
significant differences between the 
three types of tests. 

Significant attempts to devise more 
elaborate and subtle tests of con- 
ceptual or abstraction ability have 
been reported recently (103, 113, 
120, 121, 141, 241, 264). A basic 
need in this area is a far more ex- 
tensive study of the role of abstract 
processes in normal human behavior, 
and studies like those of Heidbreder 
(137) or Hanfmann (122) represent 
only tentative beginnings. 

A second major problem in the 
area of abstract processes following 
brain lesion has been the question of 
ocalization. In the studies sum- 
ei by Klebanoff (164) the pre- 
'ominant trend suggested that le- 
vd of the frontal areas in particu- 
Er 

. y Rylander 

cm Halstead (120) particularly have 
" Pported this view. The latter in- 
pcr presented a small number 
s demonstrating greater loss of 
in ract ability on the category test 

patients with frontal ablations 
piis in patients with lesions in other 
TAN Using a study of 147 cases of 

matic head injury, Halstead re- 


ported that the tendency these pa- 
tients showed for impairment, as 
compared with controls, suggests 
again the greater effect of frontal in- 
jury, since mechanical and autopsy 
studies (65, 140) indicate that frontal 
injury is most likely to follow any 
type of head trauma, Lacking neu- 
rotic controls for his head injury 
group, Halstead's results are of lim- 
ited generality, in view of the findings 
of Ross and Ross (231) which suggest 
difficulties in differentiating between 
these groups. The fact is that Hal- 
stead's patients with nonfrontal le- 
sions also obtained scores consider- 
ably below those of the controls. 
This suggests impairment which, in 
view of the small number of subjects, 
may not be, practically speaking, less 
significant than findings for the 
frontal lobe patients. Finally, the 
age differences of the groups indicate 
that the frontal lobe patients were, 
on the average, seven years older 
than nonfrontal patients, who, in 
turn, tended to be somewhat older 
than the controls, so that age differ- 
ences rather than location of lesion 
might account for the results. It 
would appear, therefore, that Hal- 
stead's views are in need of further 
validation. 

A series of well-executed studies 
(30, 264, 268) has attempted to test 
the hypothesis that abstract thinking 
and fairly complex visual functions 
are most readily impaired by frontal 
lesions. These authors studied three 
well-matched groups of veterans, pa- 
tients with anterior lobe traumatic 
lesions, patients with parieto-occipi- 
tal traumatic lesions, and patients 
with peripheral nerve injuries who 
served as nonbrain-injured controls. 
Lesions were localized by wound of 
entrance. From a varied series of 
complex visual tasks, including the 
Gottschaldt "hidden figures," the 
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Wisconsin Sorting test (103), a 
visual-choice reaction test, and an 
adaptation for humans of Maier's 
reasoning situation, these authors 
concluded that test performance of 
groups of brain-injured was signifi- 
cantly below that of control subjects, 
despite fairly equal motivation for 
solution. On some tasks, the occipital 
lobe patients were inferior to the 
frontal patients. The authors feel 
that these results suggest that com- 
plex visual performance involving 
aspects of abstraction is at least as 
difficult for patients with posterior 
lesions as for patients with frontal 
lobe lesions. 

While these results cannot be com- 
pared directly with those of Halstead 
(120) because of the difference in the 
nature of the pathology (trauma ver- 
sus tumor), they do point up the 
serious consequences of any type of 
brain injury, particularly when it is 
recalled that the patients, unlike 
most tumor patients, were tested four 
to seven years after injury. On the 
other hand, since both the studies of 
Halstead (120) and Teuber and Ben- 
der (268) lack any anatomical or 
pathological data definitely localiz- 
ing the injuries, the results have only 
limited value for theories of brain 
function. In view of the findings of 
Holbourn (140) and Courville (65), 
Halstead might well argue that the 
trauma of the impact of high velocity 
missiles in any area of the brain 
would most likely involve the frontal 
lobes in any case, thus questioning 
the conclusions of Teuber and Ben- 
der. 

An interesting contribution to the 
problem of localization comes from 
results presented by McFie and 
Piercy (192), who studied 74 pa- 
tients, largely tumor cases, with uni- 
lateral lesions. Employing the Weigl 
form-color sorting test, they found 


that patients with left-sided lesions 
proved significantly inferior to pa- 
tients with lesions located in the so- 
called nondominant right side, irre- 
spective of location in the frontal, 
parietal, or occipital lobes. This find- 
ing prevailed even when the aphasic 
patients were excluded. These results 
somewhat contradict those of Ander- 
son (16) whose studies with the 
Wechsler-Bellevue subtests showed 
opposite results for patients with 
dominant and nondominant hem- 
isphere lesions. Here the former pa- 
tients proved inferior in verbal tests 
and superior in performance tests. 
Difficulty in comparing these results, 
a persistent problem because of brief 
reporting, makes conclusions n:ces- 
sarily tentative. 


Personality Following Brain Lesions 
and Ablations ý 


Research on personality function- 
ing following brain lesions and abla- 
tions has been based chiefly upon the 
use of the Rorschach test. A major 
difficulty arises in conclusions drawn 
from the Rorschach since it is appar- 
ent that the critical factors leading 
to differentiation of organic patients 
from others derive not from general 
personality characteristics as much as 
from disturbances in thinking. and 
perception of the types described 
above. 

The practical usefulness of the 
Rorschach in studies of brain injury 
is emphasized in a number of studies. 
Aita, Reitan, and Ruth (5) compared 
60 patients with posttraumatic brain 
injury with 100 controls representing 
a heterogeneous group of hospital 
patients. While quantitative analysis 
of Rorschach records yielded no con- 
sistent picture, use of certain of 
Piotrowski’s qualitative signs (218), 
e.g., impotence, perplexity, repetition, 
and color-naming, in addition to. 
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other signs suggesting extreme con- 
creteness, catastrophic reactions, in- 
flexibility, and vagueness, proved 
helpful. Many neurotic-like signs, 
depression, anxiety, and hypochon- 
driasis were also found among the 
organics. Insufficient data are pre- 
sented for more definitive evaluation 
of the discriminability of these signs, 
but it is clear that disturbance in the 
capacity for abstract thinking mani- 
fested by rigidity and extreme con- 
creteness emerged primarily. 

Koff (168) similarly reports the 
usefulness of the Piotrowski (218) 
signs in differentiating postconcus- 
sion neurotics from patients with 
spinal tap evidence of brain involve- 
ment. Again, differentiation is based 
chiefly upon those aspects of Ror- 
schach performance which may well 
reflect thought and perceptual dis- 
turbancesas the primary impairment. 
Further evidence which supports the 
value of a related technique calling 
for subjects to draw impressions from 
the Rorschach cards comes in a series 
of papers (51, 104, 179). Using both 
the Rorschach and the Graphic 
Rorschach, Grassi (104) reports ten 
signs generally similar to those of 
Piotrowski which proved most dis- 
criminating between organics and 
other syndromes. A distinction be- 
tween blot and concept dominance in 
performance is made by these au- 
thors who point out the extent to 
which brain-injured patients are blot- 
dominated. 

The general trend of results sug- 
gests again primarily primitive think- 
ing and concreteness rather than a 
basic personality disturbance. Simi- 

r indication of a perceptual and 
prine impairment is suggested by 
he work of Hughes (143), who de- 
rived anew list of Rorschach signs 
to differentiate organics from other 
£roups of patients and normals by 


factor analysis. Another study by 
Ross and Ross (231) evaluated a 
number of different clusters of signs 
and, after extensive manipulation, 
developed a fairly complex scoring 
system of “instability” and ''disabil- 
ity” ratings that differentiated nor- 
mals from neurotics and brain- 
damaged patients. 

The “sign” approach, as seen in 
these studies, seems to work fairly 
well with brain-injured patients, but 
extensive cross-validation seems es- 
sential. There has been little effort 
to establish the reliability of various 
"signs." In general, qualitative 
analyses of the Rorschach protocols 
seem to be most successful in select- 
ing brain-injured patients. The sign 
approach, excessively empirical, gives 
little real feeling for the basic per- 
sonality factors that emerge as a con- 
sequence of brain injury. It should be 
noted that signs described, such as 
impotence, perplexity, inflexibility, 
etc. are not intrinsic, specifically 
called forth by the Rorschach blots. 
They merely represent behavioral evi- 
dences of slowed reaction times, ab- 
normal concreteness, perceptual dis- 
turbances, and awareness of impaired 
functioning. It seems that the Ror- 
schach has thus far offered little 
that is new toward our understanding 
of the personality of the brain- 
damaged. However, the value of in- 
tensive Rorschach studies of indi- 
viduals with brain damage is brought 
out by several case studies (2, 71, 
207, 306). 

Somewhat unique in the field of 
personality studies of the brain-in- 
jured is a research by Anderson and 
Honvik (17) comparing Minnesota 
Multiphasic Personality Inventory 
profiles of patients with frontal lobe 
lesions and with parietal involve- 
ment. The frontal lobe patients ap- 
proximated the clinical picture of the 
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"hysteriod reaction type" while the 
parietal patients more closely re- 
sembled that of the "anxiety neuro- 
sis." Since this study lacked any 
normal or neurotic control groups and 
since no estimate of the degree of 
overlap is presented, conclusions re- 
main tentative. 

Summarizing the results of the 
meager personality studies of brain- 
injured patients, it would appear 
that in general these patients mani- 
fest abnormal concreteness, diffuse 
and overly generalized modes of or- 
ganizing their experiences, uncon- 
trollable emotional outbursts, diffuse 
anxiety ("catastrophic reactions"), 
and a profound sense of personal in- 
adequacy (1, 5, 104, 148, 155, 179, 
306). Lacking in the studies of the 
personality of the brain-injured has 
been any attempt to consider the 
personality dynamics and the ade- 
quacy of interpersonal relationships 
following brain damage of various 
types. Comparatively little effort at 
systematically observing social inter- 
action patterns of the brain-injured 
has been reported, although some 
authors (6, 94, 203, 278) have re- 
ferred to problems of this sort. 


Psychological Concomitants of Epi- 
lepsy 

Psychological studies of patients 
with convulsive seizures diagnosed as 
epileptics have sought generally to 
answer two questions: Is there evi- 
dence of psychological deficit or de- 
terioration in patients with epileptic 
seizures? Are there distinctive per- 
sonality types or patterns of traits 
characteristic of epileptic patients? 
Since epilepsy has generally been 
considered a physiochemical disease 
(177), treatment has been largely by 
chemical means. For many years, it 
was felt that the chemical disturb- 
ance in the brain led necessarily to 


mental deterioration. By 1943, su 
ficient data had appeared in a variety: 
of studies to lead Lenox (177) to con- 
clude in his review that mental de 
terioration was by no means a con 
comitant of idiopathic epilepsy. A 
number of studies employing psycho- 
metric intelligence tests, usually the 
Stanford-Binet, failed to find any 
signs of deterioration (18, 74, : 
252). 

Concern in the past decade has 
centered not only upon evidence of 
deterioration but upon exploring: 
characteristic performance patte 
of idiopathic epileptics with standard 
intelligence scales. Collins and Lenox: 
(61) studied a large, heterogeneous 
group of epileptic outpatients with. 
the Wechsler-Bellevue and found 
that the group was of better than 
average intelligence, probably the: 
result of a socioeconomic selective 
factor in the group studied. A some- 
what better controlled study by 
Sands and Price (239) compared 
Wechsler-Bellevue patterns of idio- 
pathic epileptics with and without 
“personality problems." The groups. 
showed average intelligence with only | 
slight differences in their patterns. 

Somewhat more definitive findings. 
based on better controlled studies. 
have been reported by Goldman (91) 
and Winfield (298). The former com- 
pared Wechsler-Bellevue patterns of 
equated groups of idiopathic epilep- 
tics, patients with hysterical seizures, 
and patients with brain lesions but 
without seizures. In general, the 
epileptic patients and _neurotics 
showed a similar pattern and both 
groups differed strikingly from the. 
patients with organic damage, whose 
performance was generally impaired 
in the areas of concentration and ab- 
stract thinking. No impairment in 
intellectual functioning was observ 
in the idiopathic epileptics. 
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In a similar study Winfield (298) 
compared intellectual performances 
of well-equated groups of idiopathic 
("cryptogenic") epileptics, sympto- 
matic epileptics with known lesions, 
posttraumatic brain-injured patients 
without seizures, and normal con- 
trols. Where possible, school IQ or 
military test scores were used to in- 
sure matching of premorbid intelli- 
gence levels. Brief tests of verbal 
meaning, spatial relations, reasoning, 
associate learning, and abstraction 
were employed. Results indicated no 
significant differences between the 
controls and the idiopathic epileptics, 
while the two groups with known 
brain damage showed significantly 
lower scores in all areas. As in the 
study of Goldman (91), the seizure 
symptom seemed less significant than 
the factor of structural brain damage 
in relation to intellectual impair- 
ment. 

In general, results of recent studies 
of the intellectual performance of 
idiopathic epileptics do not provide 
any evidence of intellectual deteriora- 
tion or of any clear-cut character- 
istic pattern of performance unique 
to these patients. The importance 
of distinguishing between the clearly 
idiopathic or cryptogenic epileptics 
and those who show seizures follow- 
ing traumatic head injury or tumor 
has been indicated, since in the lat- 
ter group intellectual impairment 
does seem to occur. 

A more complex problem in the 
psychological study of epilepsy has 
been the attempt to delineate a per- 
Sonality pattern characteristic of 
epileptic patients. Psychological re- 
Search in this area has largely been 
restricted to Rorschach studies, fol- 
ring the original descriptions by 

orschach of a number of signs ob- 
Served in epileptics. ^ Rorschach's 
Signs suggest a general lowered 


mental functioning level, poor emo- 
tional control, and difficulty in ab- 
straction. These findings, partially 
confirmed by Guirdham (109), may, 
however, be a function of the use of 
heterogeneous, institutionalized epi- 
leptics, many of whom may have 
been symptomatic cases. Rorschach 
studies (19, 125, 169) revealed no 
peculiarly epileptic patterning, al- 
though tendencies toward poor emo- 
tional control were frequently ob- 
served. Lisansky (185), in a com- 
parison of idiopathic epileptics and 
diabetics, while yielding no clear-cut 
pattern, pointed to slower respon- 
sivity and a greater evidence of 
“neurotic signs” among the epileptic 
population. The findings of these 
studies agree chiefly in the long re- 
sponse time observed, poor form 
quality, emotional constriction, and 
poor emotional control. In view of 
the heterogeneous nature of the 
epileptic samples, however, the find- 
ings, which strongly resemble those 
for brain-injured patients on the 
Rorschach, are difficult to interpret. 

The study by Goldman (91) is one 
of the few employing well-matched 
groups. Goldman found no clearly 
defined personality pattern for the 
idiopathic epileptics. They did, how- 
ever, show quantitative and qualita- 
tive manifestations in the Rorschach 
of greater drive for achievement, 
poorly controlled emotionality, im- 
maturity, lowered "inner control and 
poise,” inward turning of affect, and 
sexual disturbance. In general, the 
epileptics resembled the hysterical 
seizure patients more closely than 
the brain tumor patients in these per- 
sonality characteristics. These re- 
sults, while somewhat limited because 
of the absence of normal controls, 
point to a strong emotional involve- 
ment in epileptics. 
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PsyYCHOSURGERY 


Attempts to determine the func- 
tions of the frontal lobes in man, 
based upon studies of individuals 
with brain pathology, have been 
hampered by the lack of evaluations 
of premorbid behavior, the uncon- 
trolled destruction of brain tissue, 
and the frequent difficulty in localiza- 
tion of areas affected. The various 
psychosurgical procedures developed 
may avoid these problems to some 
extent, but not without many seri- 
ously limiting complications. While 
individuals with presumably ana- 
tomically intact brains are subjected 
to relatively deliberate brain damage, 
these persons necessarily are in the 
midst of great emotional upheaval 
whether they suffer from excruciating 
pain, severe neuroses, or psychoses. 
In many cases, preoperative meas- 
ures of specific functions are not ob- 
tainable. Limiting the population to 
accessible patients introduces a samp- 
ling bias of unknown direction and 
degree. When examination is pos- 
sible, the reliability of the obtained 
data, particularly with psychotic 
patients, is open to question. The 
assessment of postoperative impair- 
ment of changes in functioning is 
additionally hampered by the fact 
that many of these persons were im- 
paired intellectually as a result of 
emotional factors prior to the in- 
troduction of a brain lesion. 

Essentially, frontal lobe tissue can 
be rendered nonfunctioning by three 
methods: (a) direct attack on the 
thalamus, a technique called thala- 
motomy or Sstereoencephalotomy; 
(b) cutting into the white matter and 
destroying varying numbers of fibers 
connecting the cortex and the thala- 
mus—lobotomy, leucotomy, cortical 
undercutting, etc.; (c) excision or 
destruction of the cortex—prefrontal 


lobectomy (215), topectomy (199) 
gyrectomy (212), and thermocoagula 
tion and venous ligation (200). T 
diverse techniques have been 
veloped in attempts to improve upal 
surgical procedures and to avoi 
some of the undesirable personality 
changes accompanying the 
niques as employed initially. Surgica 
precision, however, appears to lag fa 
behind the specificity of the hy 
potheses underlying these variation 
In postmortem studies Meyer à 
McLardy (201) reported great v2 
ability in the anatomical lesions pf 
duced in lobotomized patients m 
gardless of the intentions of 
neurosurgeon. In direct attacks up 
the cortex, accuracy generally is m 
increased because of individual diff 
ences in human brains and the 
of clear differentiation among areas. 
Even in thalamotomy, which im 
theory appears to be a most preci 
method, additional damage may re 
sult from the gases generated during 
electrolysis and interference wil 
blood vessels (200). The larg 
variety of psychosurgical techniq 
plus the possibility of consideral 
variation in the extent of the lesions 
produced by means of any one ope! 
tive procedure contribute to the 
difficulty of evaluating experime 
findings in terms of altered functio 
Many uncontrolled factors 
also at play after surgery which 
scure and confound changes in fu 
tioning. Social improvement or @ 
least increased cooperativeness is 4 
frequently reported concom 
along with altered attitudes of cli 
cal personnel and relatives. Attem 
to organize the many and apparent 
contradictory findings of various in 
vestigations are difficult because 0 
inadequate experimental desig! 
Samples are generally small; 
quate control subjects are rà 
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available; the time of testing, both 
pre- and postoperatively, varies con- 
siderably within as well as among 
studies; practice effects are generally 
ignored; and statistical evaluation of 
data is frequently omitted or in- 
adequate. In an attempt to bring 
some structure into this confusion, 
the present review will deal as much 
as possible with data from specific 
techniques rather than with the in- 
terpretations made by the various 
authors. 


Standardized Intellectual Tests 


Table 1 presents a summary of the 
data in a number of studies in which 
standardized intellectual tests (Stan- 
ford-Binet and Wechsler-Bellevue) 
were administered pre- and post- 
operatively with quantitative data 
reported. Examination of the post- 
operative changes reveals that most 
of these studies show a decrement in 
IQ for operated patients which in 
several instances is quite large or 
statistically significant (172, 197, 
214, 238, 304). Nonsignificant 
Changes are reported in the others 
(191, 199, 200, 220, 262, 272). 

Before evaluating these findings, 
practice effects must be assessed. In 
the first Columbia-Greystone report 
(199), Form I of the Wechsler- 
Bellevue was administered once pre- 
operatively and twice postopera- 
tively. In the second investigation 
(200), Form I was given twice pre- 
operatively and twice postopera- 
tively and Form II at the fifth ses- 
Sion. While both operated and con- 
trol groups manifested fairly con- 
Sistent increases in IQ, the operated 
groups gained less (statistically not 
Significant) than the control groups. 

he apparent increment in intellec- 
tual ability can thus be attributed to 
Other factors, most likely practice. 


This effect also is evident in Form 


— 


II of the Wechsler-Bellevue test, al- 
though to a lesser extent than in 
Form I. The question now arises as 
to the possibility that the decrements 
reported in the other studies might 
be of greater magnitude had control 
groups been available for compari- 
son. Information concerning possible 
practice effects are particularly neces- 
sary to evaluate the McCullough 
(191) and Rylander (238) studies, 
which involved more than one retest. 
In retesting the same patients nine 
months postoperatively, Petrie (213) 
reports that the significant loss in 
IQ persisted, suggesting that with 
sufficient time interval between re- 
tests, practice effects may be mini- 
mized. 

Notwithstanding the possible prac- 
tice effects, all but two of the studies 
in which data are presented, show 
some decrement in postoperative 
performance in comparison to pre- 
operative IQ or control group scores. 
Some investigators who employed 
other intelligence test batteries also 
report postoperative losses in IQ (13, 
24, 172). These studies, while not 
entirely conclusive, strongly suggest 
that intellectual impairment meas- 
urable by means of standardized test 
techniques does occur following ther- 
apeutic destruction of frontal lobe 
tissue. 

With regard to preoperative level 
and severity of illness, several factors 
stand out which should be considered 
in further studies. The higher the 
preoperative intellectual level, the 
more likely a large or significant 
decrease postoperatively (197, 214, 
238, 304). In testing à hypothesis 
dealing with increased variability in 
the behavior of posttopectomy pa- 
tients, Wittenborn and Mettler (299) 
found that ten psychotic patients 
with relatively high preoperative 
Wechsler-Bellevue scores tended to 
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TABLE 1 


SUMMARY OF STUDIES PRESENTING PRE- AND POSTOPERATIVE IQ's 
FOR PSYCHOSURGERY PATIENTS 


Mean Time Mean 
Diagnostic Surgical Preop- of 
Author N Category Technique Test erative Post- x Remarks 
IQ test nge 
Porteus & 18 Psychotic lobotomy SB 83.9 varied — —3.2 modified Stanford 
Kepner Binet 
(220) 
Rylander 5 Neurotic lobotomy SB 116.2 varied  —11.6* calculation by thes 
(238) authors 
Strom-Olsen 11 Psychotic lobotomy SB 94.3 6wks. —2.2 calculationby Crom 
et al. (262) (67 
Yacorzynski 1 Psychotic lobotomy SB 118 3mos. —21 pt. received two pre 
et al. (304) WB 104.5 3mos. —17.5 operat.ve WB's 
Koskoff et 5 Normals with lobotomy WB 87.2 3mos.  —20.4* f 
al. (172) intractable 
pain 
Malmo (197) 6 Neurotic Slobotomy WB 107.8 1-3mos. —8.3* WB Form II on rè 
1 gyrectomy test 
McCullough 10 Psychotic lobotomy WB 83.4 2mos. 2.9 mean IQ change cal 
(191) culated from 2o 
postoperative test 
Petrie (214) 20 Neurotic lobotomy WB 105.8 2-3mos.  —5.0* 
Vidor (272) 21 Neurotic& lobotomy WBor 111.5 varied 0.6 16 pts. received SB 
Psychotic SB 5 pts. received WB 
King (199) Psychotic topectomy WB 101.7  3wks. 3.9 surgical group gained 
3.7 points 
than control group 
Sheer et al. 20 Psychotic misc. WB 79.3 6mos. 3.4 WB Form ll o 
(200) < retest. Surgical 


group gained 44 
points less that 
control group 
MEE ULL 30A 05025 Be ot EMAIL S A eee E 
* Significant at .05 level of confidence. 


decrease more than four controls with 
similar initial scores. Fernandes (76), 
although giving no data, mentions 
that patients obtaining higher pre- 
operative scores tended to drop while 
those with low scores tended to in- 
crease. In Table 1, the patients who 
obtain the higher IQ's are, for the 
most part, diagnosed as neurotic 
rather than psychotic. It seems likely 
that more accurate and valid esti- 
mates of intellectual ability can be 
obtained with neurotics, so that the 
impairment that occurs following 
psychosurgery is more readily ob- 


served in these patients. The investi- 
gation of patients lobotomized for 
relief from pain, moreover, reveals à 
large postoperative decrement (172). 
To what extent the severe pain inter- 
fered with optimal functioning is not 
known, but the postoperative relief 
apparently was not sufficient to over- 
come the loss due to brain lesion. A 
complicating factor in studying such 
populations is the fact that many of 
these patients are approaching death 
rapidly. A nonoperated control group 
or at least repeated examination 
seems essential. 
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With regard to the Verbal and Per- 
formance scales of the Wechsler- 
Bellevue, higher Verbal than Per- 
formance losses have been reported 
(153, 172, 214, 304). The differential 
effects on Verbal and Performance 
tasks, however, cannot be evaluated 
without a control group for compari- 
son. Abilities measured by perform- 
ance tests may be as impaired as 
verbal functions following psycho- 
surgery, but the impairment may be 
obscured by improvement due to 
differential practice effects. Such im- 
provement also makes it difficult to 
evaluate the permanence of any 
changes, although  Petrie's study 
(213) does suggest the possibility of 
permanent impairment. Social im- 
provement does not seem to bear a 
relationship to intellectual change as 
measured by standardized intellec- 
tual tests (199, 200). 


Miscellaneous Cognitive Functions 


In dealing with the more specific 
aspects of cognitive ability, such as 
planning ability, abstract ability, 
memory, learning, attention, etc., the 
reported findings become consider- 
ably more difficult to evaluate be- 
cause of the variety of tests em- 
ployed which presumably measure 
these functions. In examining these 
functions by means of the specific 
tests employed rather than the pur- 
ported specific functions, the confu- 
sion is alleviated somewhat, and it is 
possible to suggest trends. 


Porteus Mazes 


_ Despite the susceptibility to prac- 
tice effects of a performance test such 
as the Porteus Mazes, impairment 
below the obtained preoperative level 
Is revealed in many studies on the 
first postoperative examination (197, 
199, 200, 214, 220, 221). The time of 


the first postoperative test varies con- 
siderably among and occasionally 
within studies, but the immediate ef- 
fects of surgery probably are not in- 
volved. Crown (67), reanalyzing 
Porteus and Kepner's and Porteus 
and Peter's (220, 221) data, reports 
that 23 patients whose first postoper- 
ative tests took place after three or 
more months show a highly signifi- 
cant decrement. Preoperative tests 
were not administered by Robinson 
(83), but a significant 3.3-year differ- 
ence in favor of a socially improved 
nonoperated psychotic control group 
over the lobotomized group was 
found. Two studies (153, 262) show 
insignificant improvement following 
psychosurgery. The group Jones 
(153) employed, however, obtained 
almost a minimal score preopera- 
tively. 

A summary of the data of several 
studies in which the Porteus Mazes 
were administered pre- and postoper- 
atively is presented in Table 2. Only 
the changes in the first postoperative 
examinations are given. Generally, 
additional examinations result in 
gains which eventually reach or ex- 
ceed the preoperative level, but Sheer 
et al. (200) point out that the oper- 
ated group gained less from practice 
than did the nonoperated controls. 
In the first Columbia-Greystone 
study (199), one year posttopectomy 
the surgical group had attained the 
same mental age level as the nonoper- 
ated controls. In Petrie's nine 
months postoperative study (213), 
the lobotomized patients did not re- 
gain the immediate postoperative loss 
completely, but the difference was no 
longer significant. Possibly the prac- 
tice effect was reduced by the six- 
month interval between tests. 

Porteus and Peters (221) and King 
(199) believe that a pattern of dis- 
tinct loss on immediate postoperative 
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TABLE 2 
Summary or STUDIES GIVING PORTEUS Mazes TO PsYCHOSURGERY 
PATIENTS PRE- AND POSTOPERATIVELY 


Preop- 


s; ^ / : z Mean 
Diagnostic Surgical erative Time of 
Author N Category Technique peg ul ee Remarks 
MA nge 

Jones (153) 24 Psychotic lobotomy 5.0 3 wks. 3.0 

Koskoff et al. 3 Normals with lobotomy notgiven 3mos. —4.1 
(172) intractable 

pain 

Petrie(214) 20 Neurotic lobotomy notgiven 3mos. —1.8* 

Porteus & 72 i lobotomy 11.3 varied —1.7* combined data ana 
Peters (221), lyzed by Crown (67) 
Porteus & f 
Kepner (220) D 

Strom-Olsen 11 Psychotic lobotomy 12.0 6 wks. 0.1 calculated by Crown, 
et al. (262) (67) 

King (199) 19 Psychotic topectomy 13.2 3wks. —1.2 surgical group mean 
MA is 2.2 yrs. be 
low control 

Sheer et al. 23 Psychotic misc. 10.8 10days —1.5 immediate p-eopera- 
(200) tive MA (nd pre 


test) is presented 


| 


* Significant at or beyond .05 level of confidence. 


testing followed by gains up to or be- 
yond the preoperative level in sub- 
sequent examinations is related to 
social recovery. This observation 
merits further investigation. 


Abstract Thinking 


Studies which report impairment 
on standardized intellectual tests also 
find impaired abstract ability (197, 
214, 238, 304).  Rylander (238) 
gained this impression from inter- 
pretation of proverbs and fables and 
the definitions of certain abstract 
words. Petrie (214) similarly sup- 
ports this view through the use of 
the Stanford-Binet proverbs. In ad- 
dition, however, several studies which 
report no over-all intellectual im- 
pairment do report some decrement 
in ability to think abstractly (78, 105, 
160, 199, 200). 

Studies employing the  Capps 


Homograph test consistently report 
a decrement in the ability to shift. 
Malmo (197) reports a slight but 
definite impairment. The first Co- 
lumbia-Greystone project (199) found 
a statistically significant decrease ten 
days posttopectomy. While most pa- 
tients regained the original loss, the 
decrease in the ability to make verbal 
shifts was not regained after one year 
in six of the 19 subjects. In the sec- 
ond Columbia-Greystone investiga- 
tion (200) the definite loss was con- 
firmed at the ten-day postoperative 
test. 'The loss was regained at the 
end of 30 days owing to reacquisition 
of identical definitions originally 
given but lost postoperatively. Such 
studies suggest that psychosurgery 
results in a definite, but probably 
transient, deficit in verbal ability to 
shift. 

Various sorting, grouping, and 


design tasks are less consist- 
ently reported to reveal loss due to 
Eo surgery. Employing the 
in-Scheerer tests with 42 lo- 
"botomized patients, Atwell (23) found 
only slight impairment. Grassi (105) 
found that patients who showed no 
cw intellectual ^ impairment, 
‘ a temporary reduction in 
‘achievement on his Block Substitu- 
Test. This decrement was al- 
Most completely eliminated at the 
id of a year. Practice effects were 
considered. Freudenberg and 
R n (85) report a significant 
"postoperative loss on the Kohs 
I . The operatees also gained 
"significantly less than did controls on 
La Sorting test. With the modified 
Kohs Blocks, Weigl color-form, and 
he Halstead Object-Sorting tests, 
Ris (160) concluded that some im- 
ent does occur postoperatively. 
While King (199), employing a 
ride number of measures of abstrac- 
tion, failed to show any posttopec- 
my group changes, certain patients 
did show abstraction difficulties post- 
'operatively which suggested the pos- 
Sibility that such deficit might be 
related to specific areas of the cortex 
excised. In the second project (200), 
E deficit on a modified 

/ est at the ten-day retest was 
evident, with the loss regained by 
- most patients at the three-month re- 


_ Some improvement on the color- 


form sorting test is reported by Jones 
(153), but no controls were used to 
.Setermine the amount of gain at- 
E to practice. Hunt (81) and 
NS m-Olsen et al. (262) report no 
lange in pre- and posttesting with 
Kohs Blocks while studies em- 
, E the Shipley-Hartford con- 
i ntly report no impairment post- 


— Operatively (14, 79, 228, 262). 
e 
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Memory, Learning, Attention 


Although memory and related 
functions have been found to be ex- 
tremely sensitive to impairment asso- 
ciated with brain pathology in clini- 
cal situations, the reported investiga- 
tions of individuals undergoing psy- 
chosurgery have not been fruitful. 
The gross attention and concentra- 
tion difficulties characteristic of emo- 
tionally disturbed persons probably 
interfere markedly with preoperative 
assessment of these functions. Some 
studies simply report no gross or 
permanent changes postoperatively 
(13, 14, 194, 238). Extremely varied 
results on rote memory tasks are re- 
ported by many investigators (24, 
153, 191, 197, 228). Thus, Freuden- 
berg and Robertson (85) report sig- 
nificant memory loss postoperatively 
in a group of 24 lobotomized patients 
on paired associates and recall of the 
Bender-Gestalt figures, but no im- 
pairment in memory for objects, per- 
haps because of the greater concrete- 
ness of the latter task. A fairly com- 
prehensive investigation of memory 
and learning was undertaken by 
Stauffer (199). Control and topec- 
tomy groups learned semimeaningful 
and meaningful paired-associate lists 
and a paragraph of verbal directions 
preoperatively. In general, retention 
of previously learned material and 
the ability to learn new material was 
unaffected by topectomy. 

King (199) also included in his 
study of intellectual functions, a con- 
tinuous-problem task which utilized 
an instrument intended for selection 
of pilots by the Army Air Forces. 
The test involved a complicated 
choice reaction which permitted ob- 
taining a measure of "the patient’s 
ability to perform on a task requiring 
close attention and sustained effort 
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over a period of time." While the 
topectomy group suffered no marked 
impairment on this task, the control 
group manifested a greater trend in 
the direction of more problems solved 
and fewer errors than did the oper- 
ated group. Robinson (83, 228), test- 
ing the capacity for prolonged atten- 
tion and deliberation by means of 
simple arithmetic problems, rhym- 
ing, and three of the Downey Will- 
Temperament Tests, reports clear- 
cut deficit in the lobotomized group 
as compared to nonoperated controls. 
Malmo (197) also found that de- 
liberation was reduced postopera- 
tively as indicated by more rapid per- 
formance on Raven's Progressive 
Matrices Test. Reduction in time 
and score on the Matrices Test as 
well as very rapid performance on 
the MMPI postoperatively was ob- 
served by Vidor (272). In the Colum- 
bia-Greystone project the topecto- 
mized group showed a trend toward 
poorer performance in addition tests, 
but not significantly different from 
the controls. A subtraction test 
showed significant changes in vari- 
ability attributable to the operation. 
On a cancellation. test, Rylander 
(238) reports a decrease one month 
postoperatively with subsequent 
gains to preoperative level. Hunt 
(81), however, reports postoperative 
increased time with improvement in 
accuracy on a cancellation test, as 
well as improvement in immediate 
memory. 


Sensory Functions 


The relief of intractable pain by 
means of psychosurgery is appar- 
ently afforded without increasing the 
threshold for pain. Using ordinary 
clinical tests in neurological examina- 
tion of such patients, Watts and Free- 
man (274) failed to disclose any evi- 
dence of impaired sensation. Indeed, 


Chapman, Rose, and Solomon (55) 
report increased withdrawal reactions 
to pin prick are not uncommon in 
lobotomized patients. Following up 
this clinical impression and employ- 
ing the Hardy-Wolff-Goodell ap- 
paratus for more accurate evaluation, 
they found that postoperatively their 
group of 23 psychiatric patients with- 
drew their heads from the apparatus 
at less intense levels of stimulation 
than before lobotomy. Over a two- 
year period, consistent trends toward 
a return to preoperative threshold 
levels were revealed (56). Malmo 
(197) also found a much higher rate 
of withdrawal after lobotomy. King 
et al. (158) supported these findings 
in a study of five patients operated 
unilaterally for relief of pain. In the 
second Columbia-Greystone study 
(200), slightly reduced thresholds were 
observed in only two of the nine oper- 
ated patients. In this study, the 
radiant heat was applied to the fore- 
arm rather than the forehead. 

Extensive psychophysiological 
studies were employed in both Co- 
lumbia-Greystone projects. No 
marked losses or consistent changes 
were found in auditory acuity, visual 
acuity, peripheral vision, brightness 
discrimination, color vision, time 
judgment, autokinetic effect, critical 
flicker frequency, recognition of ta- 
chistoscopically exposed stimuli, or 
various motor tasks. 


Personality 


In his review of the clinical studies 
of lobotomized patients Crown (67) 
points out that “almost invariably 
. . . personality changes have fol- 
lowed the operation.” These changes 
are in the direction of increased cheer- 
fulness, complacency, apathy, rest- 
lessness, shallowness of affect, indif- 
ference to criticism and feelings of 
others, and decreased self-conscious- 
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ness, reserve, and tact. Kolb (171) 
also points out that there is little 
difference of opinion among clinicians 
as to the nature of the personality 
changes following lobotomy. 

Rating scales have been employed 
in several studies (153, 226, 244), 
which generally found improved ward 
behavior. Cooperativeness, sociabil- 
ity, and tidiness are increased; anxi- 
ety, depression, and bizarre behavior 
are decreased. Affect and feeling are 
adversely affected in the direction of 
greater apathy (226). Lack of initia- 
tive is also noted (244). In a series of 
ratings, Jones (153) indicates that for 
the group, over-all behavior did not 
improve after the eighth week post- 
operatively. A time sample of be- 
havior was obtained for surgical pa- 
tients alone and in pairs by Kinder 
and Willenson (200) with no changes 
in patterns of behavior apparent fol- 
lowing surgery. Patients under 40 
showed some increase in activity 
while those over 40 manifested a de- 
crease. Bockoven and Hyde (106) 
observed 16 patients pre- and post- 
lobotomy in groups, recording socio- 
grams of the patients’ interactions. 
Improvement in psychiatric cases 
was generally accompanied by in- 
creased socialization and develop- 
ment of a friendly democratic atti- 
tude toward other patients. It is 
possible that increased familiarity 
With observers and examiners and 
the additional attention may lead to 
improvement and reduction in anxi- 
ety. 

'The necessity for control groups is 
Suggested by the findings of Witten- 
born and Mettler (299). Employing 
yom rating scales, no significant 
oo between the topectomized 

nd control groups were found; both 
groups manifested a reduction in 
Symptomatology. Indeed, a return of 
certain pathological symptoms was 


noted in some surgical patients who 
had been free from such pathology 
prior to topectomy. 

Noticeable lessening of psychotic 
symptomatology is reported for lo- 
botomized patients in studies employ- 
ing personality inventories (14, 81, 
199, 272, 304). Standardized inter- 
views utilized to obtain attitudes 
dealing with feelings of guilt, religion, 
sex, and prejudice failed to reveal any 
over-all attitudinal changes (200). 
Employing a sensibility question- 
naire to measure degree of concern 
with one’s past and future and with 
the opinions of others and a self- 
regarding span (a measure of the 
time spent talking about oneself), 
Robinson (83) compared a loboto- 
mized group with a nonoperated con- 
trol group and found that the oper- 
ated patients showed less self-preoc- 
cupation, less concern with the past, 
future, and opinions of others. Thus, 
for the most part the feelings and at- 
titudes expressed by patients under- 
going psychosurgery are in accord 
with the clinical observations and 
ratings made of their behavior. 

In summarizing the Rorschach 
data for the first Columbia-Greystone 
project, Zubin (199) points out that 
some subjects showed altered per- 
sonality trends, but that no definite 
patterns of changes emerged. Analy- 
sis of group changes revealed a. pro- 
nounced decline in reaction time for 
the operated patients. Suggested 
trends were posttopectomy decreases 
in number of responses and factors 
primarily associated with anxiety, 
ambitiousness, conflict, introspection, 
and perceptual accuracy. Atwell (23) 
also reports increased constriction, 
perseveration, and stereotypy along 
with less spontaneity, initiative, and 
fantasy. The most marked postoper- 
ative change in Atwell's study was 
the patient's carefree and uncon- 
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cerned approach to the Rorschach. 
Similar changes in Rorschach factors 
also emerged in studies employing 
fewer subjects (86, 304). Jones (153) 
found àn increase in responses, but 
taking into account the initially con- 
Stricted record and possible practice 
effects of repeated testing, these re- 
sults do not contradict the above- 
mentioned studies. 

Some retest changes apparently 
may occur regardless of surgery ac- 
cording to Wittenborn and Mettler 
(299). Employing a novel Rorschach 
measure (lack-of-perceptual-control 
Score), revealed by responses in which 
form is absent or secondary to color 
or shading, they found a significant 
difference between topectomized pa- 
tients and controls. Topectomy pa- 
tients increased in “lack of percep- 
tual control” postoperatively while 
control patients showed a decrease in 
- measure. 

n à comparison of pre- and post- 
lobotomy Rorschach feo of 40 pa- 
tients, Hunt (81) reports greater con- 
Striction two weeks postoperatively. 
Somewhat contradictory to the other 
studies are findings of increased popu- 
lars and testing time and decreased 
perseveration and  selí-references. 
The patients also manifested less re- 
luctance in their approach to the 
task, less self-criticism and concern 
over performance. Amaral (13) re- 
Ports an almost identical picture 
postoperatively for 18 patients. 

n à study on creative ability, 
Hutton and Bassett (147) indicate 
that the Rorschach, the Harrower- 
Erickson Multiple-Choice Rorschach, 
à story-telling, and a drawing test re- 
veal a lessening of creative ability in 
leucotomized patients. Few patients, 
however, manifested a creative urge 
prior to the operation. In a later 

study, Ashby and Bassett (21) em- 
ployed a drawing test and found no 


difference between operatees 
psychotic controls. Both patient 
groups did worse than normal control 
subjects. 
Ashby and Bassett (22) studied the 
psychogalvanic response of 21 lobot 
omized patients and 21 controls 
real and symbolic (unfulfilled) threat, 
Six patients were also studied pre 
and postoperatively. No uniform 
postoperative trends emerged; 
symbolic threats retained the po 
of eliciting the psychogalvanic 
sponse. Thus, the authors conclude 
that emotional drive is not dimii 
ished by lobotomy. 
Petrie (214) reports a postopera 
tive decrease in neuroticism as mani 
fested by reduced body sway sug 
gestibility and a smoother work c 
in a neurotic group. Diminished in- 
troversion was also evident as 
vealed by loss in persistence, a te 
ency to go for speed rather than 
curacy, reduced self-blame, greai 
reality adjustment on level-of-aspira- 
tion tests, and a tendency to live in 
the present. | 
In general, personality studies d 
suggest a decrease in depression an 
anxiety following psychosurg 
These changes seem to occur at 
expense of greater personality con- 


ards and regard for others. Eval 
tion of the presence of “organic” i 
dicators is obscured by their prese 
in preoperative Rorschach records 
(161, 271) or by the overlap with 
postoperative psychotic residuals. 


CONCLUSIONS 


The last decade has witnessed 
significant change in the orientati 
of research in the field of orga 
in his earlier review, had concluded 
that psychological studies in this 
field had been concerned primarily 
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with relating mental functions to 
localized areas of the brain, the de- 
termination of the presence of brain 
pathology on the basis of psycho- 
logical test performance, and the 
definition of an “organic psychologi- 
cal syndrome." Methodologically, 
essential reliance had been placed 
upon the use of omnibus test tech- 
niques such as the Stanford-Binet and 
Rorschach. However, the beginning 
of a trend toward the use of more 
specialized test techniques had been 
observed. 

The period covered by the present 
review reveals decreased emphasis 
upon localization and diagnosis and a 
concomitant increase of interest in 
the developmental aspects of brain 
injury, emphasis upon patterns of 
functioning through the use of spe- 
cialized tests including laboratory 
methods, and an overly hasty con- 
centration of effort upon the psycho- 
logical consequences of psychosurg- 
ery. 

During the past ten years, in- 
creased theoretical sophistication and 
research findings have served to dispel 
the earlier optimism concerning the 
ability of psychological test tech- 
niques to contribute toward localiza- 
tion of brain pathology. The correla- 
tion of psychological test performance 
with specific areas of brain damage 
has been found to be limited by vast 
differences in brain pathology caused 
by different types of injury or disease 
as well as by serious limitations in the 
techniques of anatomical localization. 
In general, psychological instruments 
have proven incapable of differenti- 
ating patients with presumptive in- 
JRA to specific cortical areas. In- 

m the limited number of studies 
Which do report such differentiation 
Fd repetition or cross-validation, 

| apparent constant necessity in 
this field of research. 


The continued development of spe- 
cialized test methods reflects the in- 
ability of conventional psychometric 
techniques to reveal clear pictures of 
organic impairment. Thus, the con- 
siderable research employing the 
Wechsler-Bellevue scale reveals that 
although the test as a whole proves 
relatively insensitive for diagnostic 
purposes in patients with brain in- 
jury, certain of the subtests appear to 
be quite discriminating in numerous 
studies. In addition, the trend to- 
ward the use of laboratory perceptual 
techniques, such as the critical flicker 
fusion, the phi phenomenon, and the 
tachistoscopic presentation of stimuli, 
have opened additional horizons for 
study and merit concerted research 
exploration. These methods do reveal 
promise, but one is not yet able to 
evaluate their ultimate significance 
as differentiating techniques. 

A further significant development 
in recent years has been the rather 
extensive research upon children with 
organic brain disease. Analysis of the 
deficit findings in children with brain 
damage reveals that the impairment 
tends generally to parallel that ob- 
served in adults with brain damage. 
There is, however, the definite sug- 
gestion that children with brain 
pathology manifest marked uneven- 
ness and inconsistency in the develop- 
ment of their intellectual capacities, 
and this may indicate more general- 
ized cerebral localization of function 
in children. The research directed 
toward differentiating endogenous 
and exogenous feebleminded children 
has suggested promising new psycho- 
logical approaches and techniques in 
addition to making interesting theo- 
retical contributions. 

Finally, it is felt that subsequent 
research in this field should recognize 
the importance of the interaction of 
such related variables as environment 
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and premorbid personality of patients 
with organic brain damage. It is ap- 
parent that intellectual deficit must 
be evaluated in relation to the rich- 
ness and complexity of past and pres- 
ent environmental situations. In- 
deed, the tempo and degree of so- 
called deterioration in certain senile 
and organic conditions may be deter- 
mined significantly by the nature of 
social and other environmental fac- 
tors. [n addition, numerous questions 
arise that emphasize the need for 
additional research dealing with the 
matter of personality changes be- 
lieved to be associated with organic 
brain disease. For example, do exist- 
ing intellectual limitations. present 
the appearance of fundamental per- 
sonality alteration or are there basic 
qualitative changes in the premorbid 
drives, motives, and basic personality 
dynamics of patients following or- 
ganic brain insult? 

It is unfortunate that most studies 
of personality changes in organic 
brain disease have utilized the Ror- 
schach test and reliance upon test 
signs which have not been ade- 
quately cross-validated. A more 
fruitful evaluation might involve an 
approach aimed at an analysis of the 
particular social, familial, and per- 
sonal demands made upon the patient 
with organic brain damage. Follow- 
ing such an analysis of expected and 
desired behavior, specialized psycho- 
logical techniques might be employed 
to evaluate the capacities required to 
fulfill these environmental demands. 
Such an approach would appear to 
offer extremely vital information con- 
cerning the consequences of brain in- 
jury or ablation. At the present time, 
there is a striking absence of psycho- 
logical research designed to under- 
stand the altered dynamic field of the 
adult with brain injury. Similarly, 
there is a need for scientific investiga- 


tion of the relationship between 
variables as premorbid perso 
and socioeconomic and cultural m i 
in relation to mode of adjustment to 
brain disease. 
When one considers the literatu 
dealing with psychosurgery, the 
sults prove generally to be discon- 
certing. Although inconsistent and 
paradoxical results are observed v 
identical test techniques are 
ployed, there are some trends in thi 
results reflected in Tables 1 and% 
The varied conclusions in this area: 
research appear to be a consequenc 
of numerous kinds of errors. First, 
psychosurgery is an extremely bro 
term and subsumes a large number a 
different surgical procedures involv- 
ing varying degrees of destruction ol 
brain tissue. Second, when psycho 
metric and psychological test tech 
niques are utilized with severely p 
chotic patients, the reliability or rep- 
resentativeness of the results may be: 
questioned. Third, the results in the 
majority of the studies are not com- 
parable since different or heterogene: 
ous diagnostic groups of patients were 
employed. Finally, a large number of 
studies dealing with the effects of 


larly by absence of the use of ade- 
quate control groups. 
Despite the difficulties cited above, 
it is possible to extract some genera 
zations regarding the impact of thera- 
peutic psychosurgery. For example, 
in those patients whose preoperati 
test performance was not marked 
disturbed, there is some suggestion 
impairment in general intelligence, 
abstract thinking ability, memory 
functioning, learning ability, and sus 
tained attention. Qualitative evalt 
ation of personality changes appe 
to indicate a more apathetic, 
complex, and constricted individua 
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who shows less introspective concern 
with himself and less depression of 
mood following lobotomy. It is clear, 
however, that experimental knowl- 
edge of the sequelae of psychosurgery 
remains restricted and conflicting. 
Continued research is necessary in 
this field with subsequent emphasis 
upon experimental design and the use 
of control groups. 

The present review has attempted 
to organize and integrate the exten- 
sive literature dealing with organic 
brain damage over the past decade. 
The voluminous body of literature 
covered reflects adequately the de- 
gree of scientific preoccupation with 
this area of research. It appears that 


the change in emphasis from mere 
diagnosis and localization in the di- 
rection of the study of related vari- 
ables represents a  scientifically 
healthy reorientation. The introduc- 
tion of laboratory test methods repre- 
sents significant continued explora- 
tion of the utility of specialized tech- 
niques. It is felt, finally, that future 
research upon the patient with or- 
ganic brain disease should regard him 
as a complex individual whose social, 
economic, and intellectual environ- 
mental demands must be considered 
in order to attain total understanding 
of the specific consequences of brain 
pathology. 
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INTELLECTUAL FUNCTION OF THE TEMPORAL LOBES! 
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The clinical literature on the intel- 
lectual effects of human brain damage 
reveals a constant preoccupation 
with the problem of the role of the 
frontal lobes, with a corresponding 
neglect of other parts of the cerebral 
cortex. In particular, there is not a 
single systematic investigation of the 
effects of temporal-lobe damage in 
man, although there are several iso- 
lated and highly suggestive reports of 
individual cases. Fortunately the 
situation is quite different with re- 
gard to animal work, which in the 
last few years has yielded numerous 
reports dealing with the effects of 
temporal-lobe lesions of varying ex- 
tent on the learning ability of lower 
primates. This material draws atten- 
tion to types of deficit which might 
well be found at the human level also, 
but which have been neglected; for 
this reason the animal data will be 
presented in some detail, before pass- 
ing to a review of the clinical litera- 
ture. 

Since this review deals only with 
cognitive functions, there will be no 
detailed discussion of the emotional 
changes often seen in temporal-lobe 
damage. Certain salient facts should, 
however, be noted. In the monkey, a 
decrease in emotional reactivity regu- 
larly follows deep-temporal removals 
(7, 17, 60, 80, 88, 105); in man, elec- 
trographic abnormality in the an- 
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terior temporal region is frequently 
associated with personality disturb- 
ances (4, 81), and ablation of the 
temporal poles may cause extreme 
docility (38). It is now believed that 
such phenomena depend largely on 
damage to the amygdala, as removal 
of this area in animals has striking, 
though inconsistent, ^ emotional 
effects, causing savageness in the cat 
(8, 101) and tameness in the monkey 
in the studies mentioned above. The 
inconsistency is puzzling, but Mac- 
Lean (74) points out that Schreiner, 
Kling, and Galambos have recently 
described increased docility in cats 
also after bitemporal excisions, al- 
though this does not appear in the 
published abstract (97). Stimulation 
of the amygdala in waking animals 
has also been carried out by Gastaut 
(35) for the cat, and by MacLean and 
Delgado (75) for both cat and mon- 
key, and here there was no contradic- 
tion, well-organized rage responses 
being elicited from both species. 

Such observations form part of 
broader inquiries into the role of the 
phylogenetically more primitive parts 
of the cerebrum in emotion. Thus 
Gastaut (35), in an admirable review 
of recent work on the rhinencephalon, 
points out the rich connections and 
structural complexity of the amyg- 
dala, which could well make this à 
key area in any emotional regulatory 
System. This literature has also been 
surveyed by MacLean (73, 74), but 
with theoretical speculations whic 
are perhaps more elaborate than the 
data warrant. Further discussion 
would be out of place here. 
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REVIEW OF ANIMAL STUDIES 


The cognitive effects of temporal- 
lobe damage have been most clearly 
demonstrated in animal studies. A 
series of careful ablation experiments 
on the posterior cortex of the monkey 
has shown conclusively that the tem- 
poral lobes have important visual 
functions, and indeed that a major 
extrastriate focus for vision is to be 
found in this region. The experimen- 
tal evidence also suggests that the 
temporal lobes play an appreciable, 
though minor, role in tactual dis- 
crimination. 


Vision in Monkey after Removal of the 
Temporal Lobes 


The first evidence that temporal- 
lobe lesions affect visual functions 
appears in the work of Sanger Brown 
and Schafer (17), who in 1888 re- 
ported gross, though transient, dis- 
turbance of visual object recognition 
in monkeys after removal of the tem- 
poral lobes. However, the importance 
of this finding does not seem to have 
been realized at the time, and no 
formal testing was done. Recent 
interest in the topic stems from 
Klüver and Bucy's (60) dramatic 
demonstration in 1938 that destruc- 
tion of the temporal lobes in the 
macaque brings about a state of 
‘psychic blindness,” which they be- 
lieved to be similar to that of visual 
agnosia in man. Their animals no 
longer discriminated between edible 
and inedible objects, nor between 
neutral and previously fear-provok- 
ing. ones, and were markedly handi- 
capped on tests of form discrimina- 
tion; yet these disturbances could not 
be attributed to sensory loss, since 
the animals showed only slight upper- 
quadrant field defects and were ab- 
normally reactive, rather than unre- 
Sponsive, to visual stimuli. 

One interpretation (33) of these 


results attributes the deficit to acci- 
dental damage to area 19, areas 18 
and 19 being traditionally regarded 
as the visual association area, the seat 
of higher visual functions. However, 
the evidence is strongly against such 
an interpretation. Attempts to dupli- 
cate Klüver and Bucy's findings by 
extensive lesions within the prestriate 
cortex have failed. Large selective 
ablations of either the lateral or the 
mesial surfaces of areas 18 and 19 can 
be made with only slight visual dis- 
turbance (19, 64); and it has been 
claimed by Chow (21) that destruc- 
tion of all but the inaccessible poste- 
rior ventromedial sector of the pre- 
striate regiononly retards visual learn- 
ing in so far as the animal may have 
to adjust to an excessively restricted 
visual field. As against these nega- 
tive findings, the evidence for visual 
deficit following prestriate ablation is 
meager and internally inconsistent: 
Chow (19, 21) and Lashley (64) failed 
to confirm Ades's (1) report of am- 
nesia for visual habits following one- 
stage bilateral prestriate ablation. 
Lashley's own finding of deficit on a 
visual conditional reaction has not 
been corroborated by Chow (21), and 
disturbance of object recognition in 
the first few postoperative days was 
seen by Chow but not by Lashley. 
According to Riopelle and Ades (un- 
published study, cited in 92) there 
may be transient retardation of new 
learning, but there is no residual 
deficit. It is clear that the picture 
of mild, transient, and variable dis- 
turbance which emerges from these 
studies discredits the traditional view 
that the prestriate cortex is the major 
visual association area. Coupled with 
Klüver and Bucy's findings, it directs 
attention to the temporal lobes as a 
probable extrastriate focus of visual 
functions. 

Klüver and Bucy's temporal-lobe 
extirpations were large, and included 
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the hippocampus and amygdaloid 
complex bilaterally, sparing only the 
primary auditory area (and most of 
the optic radiations). Later work has 
confirmed that such extensive dam- 
age to temporal neocortex and allo- 
cortex suffices to cause profound vis- 
ual disturbance, even though care is 
taken not to invade the prestriate 
region (12, 80, 88, 90, 91). The dis- 
turbance has tended, however, to be 
less dramatic than that described by 
Klüver and Bucy, and only Mishkin 
(80) has reported persistent loss of 
object recognition, which is of course 
the essential feature of visual agnosia 
in man. In formal testing situations 
all animals show impairment on a 
variety of visual discrimination prob- 
lems after removal of the temporal 
lobes; the deficit is apt to be particu- 
larly severe on tests of pattern dis- 
crimination but there may also be 
disturbance of color (12, 19, 80), and 


even of brightness discrimination * 


(19, 80). 

When attempting to evaluate the 
effects of temporal-lobe lesions, two 
points should be emphasized. - First, 
the behavioral changes cannot be 
attributed to the slight and variable 
visual field defects that have been 
found, since much greater field de- 
fects may occur after parietal-cortex 
removal, but without producing such 
disturbance (12); the same is true in 
the case of extensive subtotal damage 
to the striate cortex (40, 59, 98). 
Secondly, the effects cannot be at- 
tributed to general intellectual deficit. 
The attempt to limit visual functions 
to the striate cortex, while ascribing 
visual deficits seen with extrastriate 
lesions to a nonspecific loss (65, 66), 
simply does not fit the facts. An ani- 
mal may succeed on auditory or 
somesthetic problems at a time when 
it shows severe visual loss, and there 
is evidence that temporal-lobe dam- 
age need not prevent good perform- 
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ance on delayed-reaction problem 
(52, 80), although in some cases it 
apparently does (12). Thus we art 
forced to conclude that temporal: 
lobe ablation has a selectively deti 
mental effect on visual learning and- 
retention. 


Chow (20) has been able to show 
the loss of previously learned visual 
discrimination habits after temporal: 
lobe removal is not due to the de: 
struction of specific memory traces, 
since in one out of two monkeys fur- 
ther training on other discrimination 
problems reinstated the original hab: 
its. (The training is considered imr 
portant, since no such dramatic re 
covery occurred in any of the three 
control animals.) But it is doubt! 

if anyone would defend the view that 
temporal-lobe ablation primarily de- 
stroys specific memory traces. i 
the contrary, most emphasis has been 
laid on the difficulty of new learning 
(60, 80, 90). In view of this, it 


learning, especially as one experimen- 
tal animal completely failed to lea 
a new, or to relearn an old, patte 
discrimination. It is true that the 
deficits he reports are in general mild, 
but this may well be due to the f. 
that his lesions spared the vent 
medial portion of the temporal lob 
an area which, aswe shall see later 
particularly important for vis 
functions. It should be added t! 
the small number of animals and th 
marked individual differences in per- 
formance further limit the inferences 
which can usefully be drawn from this 
study. ; 4 

Chow's own conclusion "that the 
temporal neocortex of monkey is not. 
a storehouse for visual memories, bul 
that it exerts some facilitative inflt 
ence on the neural substrate for visual 
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discrimination" (20, p. 436) is cau- 
tious enough. Lashley (66), however, 
has gone a step further in citing 
Chow's data as evidence against the 
view that the temporal lobes have a 
visual function. The argument does 
not seem convincing. It rests on the 
assumption that if there is no loss of 
certain specific visual memories and 
no clear-cut sensory defect (such as a 
restricted visual field or a total loss 
of color vision), then we must be 
dealing with some quite general im- 
pairment, such as a loss of compari- 
son attitude or a lowering of “‘vigi- 
lance." Sucha view does scant justice 
to the complex nature of visual per- 
ception; it also ignores clear experi- 
mental evidence that temporal-lobe 
ablation does not affect all discrimi- 
nation learning equally, but produces 
a marked deficit only on visual prob- 
lems. 
This deficit is admittedly hard to 
define. A visual disturbance which 
affects brightness, color, and pattern 
discrimination, while leaving object 
recognition and visual acuity intact, 
Cuts across the classical distinction 
etween complex “associative” func- 
tions and simpler sensory ones, and 
thus lends support to those who, on 
the basis of systematic studies of 
human brain damage, doubt the 
existence of pure disorders of visual 
integration (9, 10). It has been 
Pointed out that we cannot decide 
intuitively what constitutes a simple, 
and what a complex visual task (19, 
E If complexity is measured in 
€rms of difficulty for the intact ani- 
c then Mishkin (80) finds, and 
Ne and Ades (90) agree, that, 
r things being equal, the degree 
ol retardation on a series of visual 
‘crimination problems after tem- 
Poral-lobe ablation varies directly 
with the complexity of the problem. 
et we know that this is not so for all 
of visual problem. Riopelle and 


his associates (91), using patterned- 
string problems, found that the over- 
all level of performance was not sig- 
nificantly changed by removal of the 
temporal lobes, but that the order of 
difficulty of the problems was not the 
same for the normal as for the oper- 
ated monkeys. The data suggest that 
certain configurations which are per- 
ceptually misleading for the normal 
animal become less compelling after 
destruction of the temporal cortex. 

The fact that the same operated 
monkeys failed to form learning sets 
when tested with a series of different 
object-discrimination problems is a 
new and interesting finding (91). 
Apparently the monkeys found the 
three hundredth problem as difficult 
as the first, although some progress 
was always made on the six trials 
allotted to any single problem. These 
observations are clearly important, 
but interpretation is complicated by 
the fact that the operated animals 
never reached the same level of pro- 
ficiency as the normals on the individ- 
ual problems, so that the interprob- 
lem transfer effects for the two groups 
cannot be compared fairly. 


Visual Localization within the Tem- 
poral Lobe (Monkey) 


The experiments just discussed 
were undertaken in order to define 
more precisely the nature of the defi- 
cit seen after destruction of the tem- 
poral lobes. The question next arises 
as to whether a comparably severe 
visual disturbance may be produced 
by smaller lesions within the massive 
temporal cortex, or whether the only 
important factor is the amount of 
temporal-lobe tissue destroyed. 

Reviewing the literature with re- 
spect to the lateral surface, we find 
some inconsistencies. Kliiver and 
Bucy (61) emphasize the lack of 
gross visual disturbance following bi- 
lateral removal of either the first, sec- 
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ond, or third temporal convolutions, 
and Ades and Raab (2) report that 
damage to the lateral surface had no 
effect on a learned pattern discrimina- 
tion (they do not report the extent of 
the damage). Similarly, Mishkin 
(80) presents essentially negative re- 
sults from lateral-surface extirpa- 
tions, although he finds retardation 
in the mastery of new and difficult 
discrimination problems. However, 
Chow (19) describes somewhat more 
extensive impairment; after lesions in 
the region of the middle temporal 
convolution his monkeys showed 
deficits in retention and in subse- 
quent reacquisition of various visual 
discrimination habits. 

The effects of damage to the 
ventromedial surface of the temporal 
lobes are more consistent and more 
striking. The most systematic be- 
havioral study is that of Mishkin (80) 
who reports that animals are as 
handicapped on formal tests of visual 
discrimination after ventral-surface 
extirpations as they are after com- 
plete removal of the temporal lobes, 
the only difference being that with 
the smaller lesions the visual disturb- 
ance is not apparent to casual inspec- 
tion. Persistent impairment of form 
discrimination after damage to the 
“baso-medial” cortex of the temporal 
lobe is also reported by Poirier (88), 
thus confirming the importance of 
this area for visual functions. 

A further problem of localization 
is raised by the suggestion that the 
peculiar effectiveness of deep tem- 
poral-lobe lesions may be due at least 
in part to destruction of the hippo- 
campus (2, 60, 64). The hypothesis 
has been refuted by recent work. 
Mishkin (80) has data which indicate 
that where visual impairment follows 
removal of the hippocampus it is due 
to cutting of the white matter in the 
ventral surface of the temporal lobe, 
and not to destruction of the hippo- 


campus per se. The fact that severing 
the fornix, and so interrupting the + 
long projecting pathways of the 
hippocampus, has no effect on visu- 
ally guided behavior (34) strengthens 
this interpretation. 

The experimental evidence pre- 
sented so far has established the spe- 
cial importance of the ventromedial 
part of the temporal lobe for visual 
discrimination; the lateral section is 
implicated to a lesser degree, and 
minimal disturbance follows damage 
to the prestriate cortex. But this 
account is incomplete, in that it is à 
based on the effects of circumscribed 
lesions of temporal lobe or prestriate 
cortex. There is also evidence to show 
that posterior-cortex ablations which 
involve combined destruction of the 
prestriate and lateral-temporal neo- 
cortex are in fact exceedingly dam- 
aging to visual function. 

Lashley and Clark (67) point out ; 
that there is no anatomical justifica- 
tion for subdividing the posterior as- 
sociation area of the macaque neo- 
cortex into separate parietal, tem- 
poral, and preoccipital regions. It is 
also known that only minor be- 
havioral changes result from isolated 
destruction of any of these areas (2, 
64, 94). On the basis of such facts 
Blum, Chow, and Pribram (12) argue 
that the posterior neocortex, exclu- 
sive of sensory projections, should be 
treated as a unit, and that, as such, 
it must be ablated in its entirety if 
we are to discover its true functional 
significance. They accordingly made 
extensive lesions on the lateral aspect 
of the posterior cortex, involving 
damage to parietal, temporal, an 
preoccipital regions, and thereby pro 
duced a very marked loss of discrimi- 
natory ability. It is remarkable that 
such lesions cause at least as much 
visual impairment as does complete 
removal of the temporal lobes, de- 
spite the fact that the important 
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ventromedial area of the temporal 
lobe is left intact. This finding has 
since been amply confirmed (92, 107); 
it is also consistent with an earlier 
report of permanent loss of a pattern- 
discrimination habit after combined 
destruction of the lateral-temporal 
and prestriate regions (2). It should 
be emphasized that these severe and 
persistent disturbances of discrimina- 
tory function do not automatically 
accompany any large cortical lesion, 
but are specific to bilateral posterior- 
cortex extirpations; only mild and 
essentially transient effects follow bi- 
lateral prefrontal or extensive uni- 
lateral ablations (92, 107). It is inter- 
esting that very similar conclusions 
have recently been reached concerning 
complex visual functions in man (104). 
_ To recapitulate: it has been estab- 
lished that in the macaque visual 
learning ability depends on the in- 
tegrity of a small area of extrastriate 
Cortex, the ventromedial surface of 
the temporal lobe, while no small 
lesion elsewhere is crucial. Much 
larger destructions of lateral tem- 
poral surface and preoccipital cortex 
ve equal or greater effect. It ap- 
pears therefore that facilitation from 
the lateral-temporal-prestriate com- 
plex is required for the functioning of 
ventromedial focus, or, as an 
E statement, that the two 
Bea a single system in which the 
maller ventromedial area has a spe- 
cial importance, 


Somesthetic Functions of the Temporal 
Lobe 


While the ablation studies de- 
Scribed above leave no doubt as to 
| importance of the temporal lobes 
the evidence is far less 
PN E" in the case of somesthesis. 
infers E" that Klüver and Bucy (60) 
E from the behavior of their 
“Mimals that they were unable to 


gnize objects tactually after re- 


moval of the temporal lobes, and thus 
that their “psychic blindness" was 
not exclusively visual. There was, 
however, no loss of cutaneous sensi- 
tivity, and unfortunately no complex 
tactual and kinesthetic tests were 
given. At first sight the suggestion 
that temporal-lobe damage may in- 
terfere with somesthetic functions is 
a little disconcerting, since the pa- 
rietal lobe is the traditional locus of 
such functions, and the somesthetic 
importance of the posterior parietal 
lobule has long been recognized (94). 
Recently, however, Blum, Chow, and 
Pribram (12) and Blum (11) have 
shown that extension of parietal le- 
sions into the posterior cortex of the 
temporal lobe causes a greater somes- 
thetic impairment than do posterior 
parietal lesions alone; the loss was 
measured by means of an extensive 
test battery which included tactual 
latch-box and form-discrimination 
problems. These studies also show 
conclusively that in the monkey 
temporal-lobe ablation by itself is 
essentially without effect upon such 
tests; the most that could be demon- 
strated was a slight difficulty in the 
discrimination of roughness (12). It 
follows from these two findings that 
the temporal lobe is not focally con- 
cerned in tactual and kinesthetic 
functions, but that it can play a minor 
role, at least when the more impor- 
tant parietal area is excised. 

Blum (11) has already commented 
on the similarity between these find- 
ings and those of Evans (28) on tac- 
tual and kinesthetic deficit in cases of 
human brain damage; both human 
and animal studies suggest that dam- 
age to cortex posterior and inferior to 
the acknowledged somesthetic area of 
the posterior parietal lobule increases 
the sensory deficit. There is also 
evidence that greater and more en- 
during somesthetic deficit results 
from cortical ablations in man than 
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in chimpanzee, and in chimpanzee 
than in monkey (11, 94). These facts 
combine to suggest that some somes- 
thetic loss may be expected in man 
after removal of the temporal lobes, 
even without parietal injury. 
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THE PHYSIOLOGICAL PROBLEM 


As has been seen, behavioral stud- 
ies show beyond doubt that the tem- 
poral lobes mediate important visual 
functions; the question therefore 
arises as to the nature of the central 
nervous pathways involved. The 
first problem is to locate conduction 
pathways from the visual receiving 
area to the temporal lobes, which are 
able to survive extensive damage to 
areas 18 and 19. Data will be pre- 
sented which argue strongly against 
an explanation in terms of transcorti- 
cal systems alone, and which empha- 
size rather the role of the pulvinar in 
higher visual functions. A further 
problem is posed by the fact that 
similar visual deficit results from 
either small lesions on the ventral 
surface of the temporal lobes, or from 
much larger removals on the lateral 
surface of the posterior association 
cortex. 

Strychnine neuronography has dis- 
tinguished three zones within the 
temporal cortex of the macaque, and 
these differences appear to be corre- 
lated with differences in function. 
The three subdivisions are: the tem- 
poral pole; the supratemporal plane, 
together with the cortex on the supe- 
rior lateral surface of the temporal 
lobe; and the main temporal sector 
comprising the ventral and postero- 
lateral portions of the temporal lobe. 
It is this third section only which 
has been implicated in visual func- 
tions, and its organization is therefore 
of paramount interest here. What is 
known of the other two temporal 
regions may be briefly summarized. 
The temporal pole, an anatomically 
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distinct region, has connections with 
many areas, notably the orbitofrontal 
cortex and the amygdala (68, 89). It 
is chiefly concerned in autonomic and 
related somatic-motor functions (55, 
75), though recent electrophysiologi- 
cal studies reveal diffuse connections 
between the temporal tip and the 
posterior association nuclei of the 
thalamus, suggesting that the polar 
region has more in common with the 
main temporal sector than is gener- 
ally supposed (3). The second region, 
that of the supratemporal plane and 
adjacent cortex, contains the projec- 
tion field of the medial geniculate, 
and subserves audition. Th rough 
studies have been made of it: trans- 
cortical and intracortical connections 
(5, 102). 

The main temporal region has 
many connections with other cortical 
areas, the most suggestive from the 
standpoint of visual function being 
those to and from the prestriate cor- 
tex. Such connections have been 
found for both the posterior lateral 
and the inferior temporal regions (6, 
77, 87). In contrast to this it has 
generally been believed that there are 
few thalamic projections to this area 
(106), but such a view is no longer 
tenable. Both Chow (18) and Simp- 
son (99) have provided anatomical 
evidence for projections from the 
posterior part of nucleus pulvinaris 
medialis to the lateral surface of the 
temporal lobe, and Chow has also 
confirmed earlier reports of projec- 
tions from the posterior part of 
nucleus pulvinaris lateralis to the 
posterior temporal cortex. Further 
more, Pribram and Mishkin? have 
now found that, in the baboon, com- 
bined ventral surface and hippo- 
campal lesions produce degeneration 
in the ventral half of nucleus pulvi- 
naris inferior and also in the postero- 

3 Mishkin, M. Personal communicatio 
1953. 
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halí of nucleus pulvinaris 
` This is taken to indicate 
that Chow's original description of 
these projections, which he placed in 
the lateral temporo-occipital area, 
“should be extended ventrally. We 
"see, then, that both the inferior and 
‘the posterolateral portions of the 

1 lobes are rich in transcorti- 

— gal and thalamocortical connections. 
In trying to elucidate the role of 
this main temporal area in visual 
Y ions, we run at once into the 
following difficulty. It is known that 
—the striate cortex (area 17) has no 
"direct connections with the temporal 
lobes, but fires locally into area 18, 
which thus acts as a way-station for 
—transcortical impulses from the pri- 
“Mary receiving area (14, 22). Yet 
—we have seen that extensive damage 
. to areas 18 and 19 does not interfere 
greatly with visual learning; particu- 
~ larly striking is Evarts’ (29) demon- 
—stration that extensive ablation of 
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"area 18 has no effect on either reten- 
tion or postoperative learning of a 
. “conditional” problem requiring the 
association of visual and auditory 
Cues. An attempt to explain such 
| data purely in terms of transcortical 

_ Connections must assume that the 
Small ventromedial section of area 18 
Temaining can function equipoten- 
tially for the whole; this seems par- 
| ticularly unlikely when we consider 
That only E association fibers are 
F5 in the visual cortex (22), and 
D on the region of area 17 immedi- 
re tely adjacent to the intact remnant 
am tarea 18 in Evarts’ experiment rep- 
NI the most peripheral part of 
. the visual field (76). One must there- 
e look elsewhere for pathways to 

t for extrastriate visual func- 


` ES 

i y Ee t seems most likely that transmis- 

a n Cim subcortical relay and associ- 
C lon nuclei will provide the answer. 


*1 Support of this, recent physiologi- 
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cal studies by Jasper and his co- 
workers may be cited (53). Working 
with monkeys under local anesthesia 
and sodium pentobarbital sedation, 
Jasper, Ajmone-Marsan, and Stoll 
(53) found that local electrical stimu- 
lation of area 17 produced strong 
firing in a narrow zone of the lateral 
geniculate and simultaneous firing of 
the adjacent portion of nucleus pulvi- 
naris lateralis. In fact, their results 
suggest that after-discharge in area 
17 is conducted more readily over 
subcortical pathways than trans- 
cortically to the parastriate area. As 
there are projections from nucleus 
pulvinaris lateralis to the posterior 
temporal region, it seems that we 
have here a means of by-passing the 
prestriate cortex. Arguing against 
this view, Chow (21) points out that 
prestriate ablations may cause degen- 
eration in the lateral part of the 
pulvinar without there being any 
lasting visual deficit. However, in 
the one case of this kind reported, 
the locus of cell degeneration in the 
pulvinar was not coextensive with 
the area to which the striate cortex 
appears to fire (53), although admit- 
tedly there was some overlap. The 
evidence at present is not sufficient to 
settle the question finally one way or 
the other, but the apparent absence 
of adequate cortical connections in 
such cases is presumptive evidence for 
a subcortical relay of the type de- 
scribed. The fact that combined de- 
struction of posterior-temporal and 
prestriate cortex causes permanent 
visual impairment supports this view, 
suggesting as it does that in such 
cases both the transcortical and the 
subcortical pathways from area 17 
have been cut. 

It remains to discuss briefly the 
second problem posed by behavioral 
studies: how destruction of a small 
area on the ventromedial surface of 
the temporal lobes, or interruption 
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of some of the pathways serving it, 
causes deficits as severe as those seen 
with extensive damage to the lateral 
surface of the posterior association 
cortex. The behavioral evidence 
clearly shows that the ventromedial 
portion of the temporal lobe contains 
an important center for visual elabo- 
ration. Physiological studies have re- 
vealed an area on the ventral surface 
to which impulses from the surround- 
ing temporal cortex and from’ the 
prestriate area appear to converge, 
and which itself sends efferent im- 
pulses to the prestriate cortex (87). 
Ventromedial-surface extirpations 
could therefore be expected to inter- 
rupt many cortical circuits; it is 
probable that such lesions also dis- 
rupt circuits between temporal lobe 
and pulvinar. Blum, Chow, and 
Pribram (12) and Chow (21) have 
denied the importance of these sub- 
cortical connections, but the evidence 
does not entirely justify their posi- 
tion. 

The principal evidence cited by 
Blum, Chow, and Pribram is that 
there is no correlation between visual 
deficit and either extent or locus of 
degeneration in the pulvinar. How- 
ever, absence of retrograde degenera- 
tion in a thalamic nucleus is no final 
proof of lack of connections between 
that nucleus and the area of cortex 
destroyed. As Jasper, Ajmone-Mar- 
san, and Stoll (53) have pointed out, 
projecting thalamocortical fibers may 
have widely distributed collaterals 
which may protect the cells of origin 
from degeneration. This would ex- 
plain the negative results of degenera- 
tion studies following moderate-sized 
ablations in the posterior association 
cortex. Itis known that hemidecorti- 
cation causes complete degeneration 
in the pulvinar, while ablations of 
parietal or temporal cortex alone 
have only minor effects (23, 24, 106). 
A further point is that loss of func- 
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tion may also be caused by interrup- 
tion of corticofugal connections from 
temporal lobe to pulvinar. Such 
corticofugal connections have been 
suggested by Jasper, Ajmone-Mar- 
san, and Stoll in the study described 
above. 

The main conclusions from this 
discussion can be summarized as fol- 
lows. There is a focus for visual elab- 
oration in the ventromedial section of 
the temporal lobe, and it is reasonable 
to believe that lesions in this area 
disrupt important corticocortical and 
thalamocortical circuits. It has been 
argued that the deficit seen with ex- 
tensive lateral surface ablations is 
partly due to the severing of path- 
ways from area 17 to the temporal 
lobes; but there is no inconsistency in 
supposing that the lateral-temporal 
and prestriate areas are themselves 
implicated to some extent in the 
synthesis of visual perceptions. 


REVIEW oF HuMAN CLINICAL 
STUDIES 


In contrast to our present extensive 
knowledge of the deficits caused by 
temporal-lobe damage in monkeys, 
until recently we could only speculate 
as to the effects of such lesions in 
man. This ignorance was due in part 
to the clinical preoccupation with the 
frontal lobes, in part to a lack of 
adequate psychological testing when 
the temporal-lobe case was seen, an 
perhaps most of all to the unsatisfac- 
tory nature of the clinical material. 
The few psychological studies re- 
ported deal with cases of brain tumor, 
where it is notoriously difficult to 
assess the full extent of brain dam- 
age, and where increased intracrania 
pressure may produce disturbance far 
from the site of the primary lesion. 
Even in cases of penetrating head 
injury or of vascular accident, it i$ 
unlikely that damage will be neatly 
restricted to the temporal cortex, and, 
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failing autopsy, we can have only an 
approximate idea of the area of 
brain destroyed. The fact that bi- 
lateral temporal-lobe lesions in man 
are rare presents another obstacle to 
our understanding of temporal-lobe 
function, although the manifest non- 
equivalence of the two hemispheres, 
at least for language, suggests that 
unilateral studies may be more re- 
warding at the human than at the 
animal level. 

Despite a lack of precise informa- 
tion, the gross symptomatology of 
temporal-lobe disease has long been 
known (32, 57, 62). The personality 
changes which are a conspicuous fea- 
ture of human temporal-lobe pathol- 
ogy have already been mentioned. 
Sensory disturbances and complex 
visual hallucinations are equally 
striking symptoms and gain in sig- 
nificance when we consider that vis- 
ual impairment is the characteristic 
feature of temporal-lobe damage in 
monkeys. Finally, there is aphasia, 
which often accompanies lesions of 
the left temporal lobe, and which 
will require special discussion later. 

Among the sensory disturbances 
Seen in cases of temporal-lobe tumor, 
auditory anomalies such as tinnitus 
are the most common (32), presum- 
ably providing the most reliable 
localizing sign. The occurrence of 
olfactory hallucinations is said to 
imply involvement of the uncus and 
adjoining rhinencephalic structures 
at the base of the temporal lobe. 
Homonymous visual field defects are 
another frequent symptom and Cush- 
ing states that these are usually 
limited to the upper quadrant con- 
tralateral to the lesion and typi- 
cally show no macular sparing; such 
defects are taken to indicate damage 
to the optic radiations as they course 
Tound the inferior part of the lateral 
ventricle into the white matter of the 
temporal lobe (26). In agreement 


with this, Penfield and Rasmussen 
(86) find that cutting of the white 
matter below the cortex and five 
centimeters posterior to the tem- 
poral tip produces a homonymous 
hemianopsia; however, in these cases 
there appears to be sparing of the 
contralateral half of macular vision. 

As for the more complex visual dis- 
turbances, it has been known for some 
time that various phenomena such as 
deja vu, micropsia and macropsia, 
and formed visual hallucinations, 
may be among the earliest signs of 
temporal-lobe disease (32, 42), and 
indeed this finding led Hauptmann 
(42) to suggest that the temporal 
region might mediate important vis- 
ual functions. Moreover, in epileptic 
patients paroxysmal discharge within 
the temporal lobe may give rise to 
perceptual illusions or hallucinations; 
such states have been called “dream 
states" by Hughlings Jackson and 
“psychical seizures” by Penfield. It 
is interesting that in such patients 
electrical stimulation of the exposed 
temporal cortex may produce com- 
plex hallucinations, both auditory 
and visual, and sometimes appears to 
reactivate old memories. However, 
such effects are not elicited by stimu- 
lation of the temporal cortex in pa- 
tients whose epileptic foci are else- 
where (86). 

It is largely on the basis of these 
stimulation studies that Penfield has 
come to attribute to the temporal 
lobes a unique role in the preserva- 
tion of memory traces, but the situa- 
tion is far from simple. Certainly 
temporal-lobe removal in monkeys 
does not cause a general memory loss, 
but has a more specifically visual 
effect. 

It is now time to consider what has 
been explicitly stated about intellec- 
tual functions in temporal-lobe dis- 
ease. As was suggested earlier, the 
clinical literature of brain tumor 
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deals mainly with mental disturb- 
ance gross enough to disrupt every- 
day behavior. Summarizing the find- 
ings in such cases, Klebanoff (58) 
states that the presenting symptoms 
of intellectual deterioration are simi- 
lar to, but on the whole less severe 
than, those seen in frontal-lobe tu- 
mor; thus the patient may appear 
confused, and may show memory dis- 
turbance and inability to concen- 
trate. The fact that the onset of 
these nonspecific intellectual defects 
may be delayed in temporal-lobe 
cases has led to the view that they 
may be secondary effects due to in- 
volvement of the frontal lobes, an 
interpretation which clearly reflects 
the traditional view that the frontal 
lobes are the site of man’s intel- 
lectual functions. There is indeed 
a suggestion that in the temporal- 
lobe cases the memory loss is apt 
to be particularly severe, affect- 
ing both recent and remote events 
(62), but the fact that removal of the 
tumor by resection of the temporal 
lobe normally causes an abatement 
of the gross symptoms of mental 
impairment advises caution in draw- 
ing such a conclusion. 

In two studies where systematic 
mental testing was done following 
excision of temporal-lobe tumors, the 
aim was to study the function of the 
frontal rather than the temporal 
lobes. Thus Rylander (95) and Hal- 
stead (39) include temporal-lobe cases 
in their brain-damaged control 
groups, and both writers affirm that 
lesions outside the frontal lobes cause 
no impairment comparable with that 
found in their frontal-lobe cases. Yet 
the value of such assertions may be 
questioned. In the first place, Ry- 
lander deliberately excluded from his 
control group patients with damage 
to either the left-parietal or left- 
temporal lobes, on the grounds that 
in such cases surgical intervention 
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~ the right temporal lobe contributes 
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might greatly injure "the tools f. 

mental activity." This treatment of 
language as a mere tool is dubious 
enough, but the risks inherent in such 
a restricted choice of subjects are 
here heightened by the use of a test 
battery composed largely of verbal 
tests. If, as will be suggested later, 


mainly to the nonverbal aspects of 
intelligence, then Rylander has not 
adequately explored the probabli 
areas of intellectual deficit in his 
seven patients with right tempo 
lobe lesions. In Halstead's study, 
nine temporal-lobe cases are reported, 
but in only four of them (two left- 
sided and two right-) did the removals 
approach lobectomy. Of these, the 
two left-sided cases showed intellec- 
tual impairment at least equal to the 
average for the frontal-lobe group, 
while the right-sided cases gave nega- 
tive results. This suggests that Hal- 
stead's special tests are in fact sensi- 
tive to left temporal-lobe destruc- 
tion. 

From this point onward the effects 
of right and left temporal-lobe dam- 
age will be treated separately: first, 
because of the prevalence of language 
disturbances (aphasia) after lesions 
of the left temporal lobe; and second, 
because of the possibility, hitherto 
unproven, that right temporal-lobe 
lesions may be more apt than left to 
cause deficit on certain nonlanguage 
tests. 


Aphasia and the Left Temporal Lobe 


Precise information concerning the 
location of a speech area within the 
temporal cortex comes from electrical: 
stimulation of the exposed brain im 
epileptic patients. "This technique 
has revealed an area in the posteri 
temporal lobe where stimulation may 
either arrest speech or cause 
patient to become aphasic (86). Th 
anterior limit of this region is ap* 
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proximately seven centimeters from 
the temporal tip, and excision of all 
cortex anterior to it can safely be 
made without causing more than a 
transient aphasia. 

The literature of aphasia provides 
striking evidence that clean surgical 
destruction is less damaging than an 
active disease process. Thus Roberts 
(93) points out that the speech area 
in the posterior temporal cortex can 
be partially destroyed during the ex- 
cision of an epileptogenic focus with- 
out any permanent aphasia, pro- 
vided, that is, that the patient's seiz- 
ures are stopped. Similarly, Fox and 
German (31) and Neilson and Raney 
(82) comment with surprise on the 
comparative mildness of the residual 
aphasia seen following complete re- 
section of the left temporal lobe in 
cases of cerebral tumor, a finding 
which apparently contrasts sharply 
with the crippling effect upon lan- 
guage of even minor vascular lesions 
within the same area (82). The data 
are interpreted by these various au- 
thors as showing that the presence of 
partial damage within a speech area 
Causes greater language disturbance 
than does the surgical removal of a 
much larger area. This is a special 
instance of a principle which has ap- 
Plication beyond the field of language 
disorders: namely, that greater de- 
terioration may result from the pres- 
ence of abnormally functioning tissue 
than from mere absence of tissue 
(13, 45, 54). 

If we now ask what the special 
characteristics of temporal-lobe 
aphasia are, we get no one clear-cut 
E Roberts (93) carried out E 
(folio c study of 71 cases of aphasia 
E ing excision of cortex ad joining 
F areas in frontal, parietal, and 

mporal regions), and found no re- 
onn between either the severity 
E poo of the language dis- 

and the site of the lesion. 
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This conclusion provides support for 
those who, like Goldstein (36, 37), 
have been consistently opposed to 
the doctrine that different language 
functions are differently localized in 
the left cerebral cortex. However, a 
more traditional view is that the 
aphasia seen in posterior lesions is 
primarily “receptive,” (a disturbance 
of comprehension), as contrasted 
with the predominantly “expressive” 
disturbance seen with anterior le- 
sions; Weisenberg and McBride (108) 
explicitly accept this formulation, al- 
though they are careful to point out 
that there is always some expressive 
defect, making the difference one of 
degree only. Head (43), who lays 
little stress on receptive disturbances, 
attributes his syntactic aphasia to 
lesions of the temporal lobe: a more 
popular view associates the temporal 
region with the auditory components 
of speech, with difficulty both in un- 
derstanding and in retention of. 
spoken language (31, 38, 56, 83, 96, 
103). These studies as a whole permit 
the conclusion that in temporal-lobe 
aphasia speech comprehension is 
rather obviously impaired, but not in 
isolation from other aspects of lan- 
guage (96, 103). 

We come now to the important 
question of how far aphasia (and here 
I include temporal-lobe aphasia) can 
be regarded as a purely linguistic dis- 
turbance which leaves "intelligence" 
essentially unimpaired, and how far 
it implies a more general intellectual 
loss. There are three main schools of 
thought: according to one (of which 
Goldstein is the best-known repre- 
sentative), aphasia is merely a mani- 
festation of a more fundamental in- 
tellectual disorder (15, 36, 37, 110, 
111); according to another, there 
may be loss, which does not affect all 
aspects of intelligence equally, so 
that the patient will do very poorly 
on some tests and comparatively well 
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on others (31, 46, 108); or again, 
there is the third possibility that 
aphasia is a mere loss of symbols, by 
which it is implied that the aphasic 
patient will have difficulties of com- 
munication, but no intrinsic difficulty 
in problem solving (41, 50, 56, 70). 
Those who adopt this last position 
state that large cortical lesions also 
impair intelligence and that for this 
very reason such cases are not suit- 
able for studying the typical pattern- 
ing of abilities in aphasia (41, 56). It 
will be argued here that it is the sec- 
ond viewpoint for which there is the 
most empirical support. 

It is clear that the testing ground 
of these rival theories must be the do- 
main of nonverbal skills. Here 
Kennedy and Wolf (56) make the im- 
portant point that many tests pur- 
porting to be nonverbal are in fact 
not so, since the score depends upon 
how well the patient has understood 
the verbal instructions; hence, unless 
it can be shown in a given instance 
that the patient has grasped what is 
required, the work is invalid. This 
means, in effect, that the only un- 
equivocal results will be those cases in 
which the patient succeeds on the 
easy items of a test (showing that he 
has in fact grasped the instructions), 
only to fail later on more difficult 
ones (46). Unfortunately, not all 
tests used in studies of aphasia meet 
this criterion. 

If the results in the clinical litera- 
ture are taken at their face value, 
there is still ample proof that aphasia 
is compatible with normal perform- 
anceon some nonverbal tests. Meyers 
(78) finds that aphasic patients per- 
form as well as their matched normal 
controls on nonverbal tests of in- 
ductive reasoning of the multiple- 
choice kind. Similarly, von Kuen- 
burg (63) has failed to find clear-cut 
differences between aphasics and 
normal control subjects on sorting 
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tests calling for various classifications 
of colors and objects. Such studies 
are valuable in underlining the diffi- 
culty that many normal persons have 
on such tasks, a fact which is too 
often ignored in clinical studies. 
Nevertheless it would be idle to 
deny that aphasic patients often show 
impairment on nonverbal tests. Van 
Woerkom (110) demonstrated such 
deficits in individual cases many 
years ago, and later work has es 
sentially confirmed this finding. As 
was suggested earlier, the aim here is 
to show that the deficit is character- 
istically mild. Weisenberg and Mc- 
Bride (108) report that their aphasic 
patients consistently made better 
scores on nonverbal tests and on 
arithmetic than on verbal tests, but 
that these scores were still distinctly 
lower than would have been predicted 
in the case of a normal person with 
the same educational history. A 
similar picture emerges from Hebb's 
(46) study of six cases of aphasia fol- 
lowing surgical excisions in the domi- 
nant hemisphere, and here repeated 
testing showed that the recovery of 
verbal and nonverbal functions fol- 
lowed the same time course. Finally, 
in a report of special interest for this 
present study, Fox and German (31) 
present follow-up observations in à 
case of left temporal lobectomy for 
cerebral tumor: they report '"'aver- 
age" performance on a series of non- 
verbal tests at a time when the 
patient continued to do poorly on 
verbal and mathematical tests; a 
this time (fifteen months after opera- 
tion), the patient still became com 
fused if he listened to speech for any 
length of time, although the more 
overt signs of aphasia had disap- 
peared. These results are not incon- 
sistent with the view that there ha 
been some loss of nonverbal ability: 
since Fox and German consider that 
their patient was originally abové 
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‘average in intelligence, and thus as- 
sume that there had been some loss 
even in nonlanguage fields. It is of 
course evident that a major difficulty 
in the way of an accurate appraisal of 
all such data is the lack of any ob- 
jective measure of the patients’ in- 
tellectual status before the onset of 


It seems that the above studies, in- 
complete as they are, justify the view 
that in aphasia the pattern of test 
performance is likely to be one of 
"severe deficit on verbal tests and rela- 
à latively mild deficit on nonverbal 
ones, There is the further point that 
the degree of this disparity between 
Verbal and nonverbal scores varies 
markedly from person to person, and 
A the tests used. These differ- 
ences, which are very striking, are 
almost certainly related to the site of 
the lesion, but how, and to what ex- 
1 t, the data do not permit us to 
An additional factor may be 
the degree to which the individual 
— formally relies on verbal cues. 

— The studies outlined above show 
the typical patterning of test per- 
mance in cases of diffuse patho- 
logical damage to the left hemisphere, 
often involving the temporal cortex. 
£ tis reasonable to suppose that a 
clean surgical removal of the left 
temporal lobe (without brain tumor) 
would also affect verbal more than 
1 Nonverbal performance, but that the 

Wer-all deficit would be less severe. 


— Special Functions of the Right Hemi- 
= Sphere 


pom 
_ The absence of aphasia following 
"s in the right hemisphere has 
"ed to a neglect of possible intellec- 
te in such cases, and has prob- 
Bn ees v ha that the 
AN ral cortex plays an es- 
| sentially ancillary role i intellectual 
| Honing. It will be shown, on the 
E Contrary ; that the evidence suggests 
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that damage to the posterior associa- 
tion cortex on the right produces 
deficit different in kind from that 
caused by comparable damage on the 
left, rather than an attenuated form 
of the same disturbance. The data 
relating specifically to the right tem- 
poral lobe are relatively meager, but 
they are completely consistent with 
the view just outlined. 

The first suggestion that right 
hemisphere damage might cause its 
own characteristic deficit comes from 
Weisenberg and McBride's (108) de- 
tailed psychometric study of 22 cases 
of right-sided lesion without aphasia. 
Although these patients were slightly 
inferior to matched normal controls 
in occupational and educational level, 
this fact cannot account for the ir- 
regular nature of the test profiles ob- 
tained. Thus, while they approach 
the normals most closely on some 
verbal tests, they are significantly 
inferior on the arithmetic and most of 
the nonlanguage tests, notably the 
Porteus Maze Test. The authors con- 
clude that the performance-test bat- 
tery revealed a significant deficit in 
the appreciation and manipulation of 
formsand spatial relationships. These 
results are quite different from those 
obtained by the same workers with 
their aphasic group; the latter, it will 
be recalled, approached the normal 
most closely in nonlanguage work 
and arithmetic. We have no precise 
knowledge as to the extent of brain 
damage in the two groups, but 
Weisenberg and McBride explicitly 
state that they were ideally matched 
in this respect. However, it is not 
clear from the way the data are pre- 
sented that the absolute mean scores 
of the right-hemisphere group on 
nonlanguage tests were lower than 
those of the aphasic group. If they 
were nót, then we cannot conclude 
that the right hemisphere is more 
important than the left for the non- 
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language tests of Weisenberg and 
McBride's battery. All that has been 
shown is that these tests are more 
sensitive than verbal ones to right- 
hemisphere damage. 

Further evidence on this point is 
provided by Hebb (44), in the case of 
a patient who had had a complete 
right temporal lobectomy for the re- 
moval of epileptogenic scar tissue. 
This patient performed well on all 
verbal tests, with Stanford-Binet 
Vocabulary at Superior-Adult level 
and full-scale IQ of 113, but revealed 
a startling deficit on tests of form 
perception, both visual and tactual, a 
finding which is in marked agreement 
with the results of Weisenberg and 
McBride. Hebb also reports that his 
patient had difficulty in following 
conversations involving more than 
two or three people, so that he would 
often make inappropriate remarks. 
This is an interesting observation, 
suggesting as it does a type of im- 
pairment which neither a conven- 
tional test battery nor a clinical inter- 
view could be expected to reveal. 
This study is valuable in that it deals 
with the effects of a well-localized le- 
sion of one temporal lobe (although 
it should be added that there had also 
been a very small cortical excision 
in the precentral gyrus). Moreover, 
as the tests were carried out eight 
years after the temporal lobectomy, 
it seems safe to assume that the de- 
ficits observed were permanent. This 
study is unfortunately incomplete, in 
that there are no preoperative data. 
Yet it is worth noting that McFie, 
Piercy, and Zangwill (72) have de- 
scribed a similar picture of superior 
verbal intelligence combined with 
marked loss of ‘“visuo-constructive 
ability," in the case of a 36-year-old 
engineer with a glioma of the right 
temporal lobe. However, they appear 
to interpret their findings as implying 
concomitant damage to the parietal 
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lobe. In this instance the patient's 
profession seems reason enough for 
regarding the deficit as a direct result 
of the temporal-lobe tumor. 

With the exception of a recent 
study by the writer (79), this ex- 
hausts the evidence relating visuo- 
spatial abilities specifically to the 
right temporal lobe. There remain a 
number of studies stressing the im- 
portance of the right hemisphere, and 
more particularly of the right parieto- 
occipital cortex, for effective spatial 
organization. The symptoms said to 
be more common in right- than in left- 
hemisphere lesions include: disturb- 
ance of the coordinates of visual 
space (49, 69); poor performance on 
visuo-constructive tasks such as map- 
drawing and block design (27, 49, 
71, 72); and difficulty in putting on 
one's clothes, "apraxia for dressing" 
(27, 48). Such findings are of course 
not in accordance with the traditional 
view, which states that damage to the 
right parieto-occipital cortex merely 
causes neglect of the left side of the 
body and of the left half of the visual 
field (16, 25). Further, many of the 
cases on which the above reports are 
based involve massive, infiltrating 
tumors, so that bilateral damage can 
often be suspected. Yet this objec- 
tion does not dispose of all cases; the 
growing number of such reports an 
the diversity of their sources support 
the idea that there is in fact a sig- 
nificantly higher incidence of visuo- 
constructive disorders in right-hemr 
sphere lesions than in left. 


A Recent Series of Temporal-Lobe 
Cases ` 


This review may be concluded by 
summarizing the results of a recent 
study by the writer (79) of 25 cases of 
temporal-lobe operation by Penfiel¢ 
at the Montreal Neurological Insti- 
tute. The study included 13 right 
temporal cases and 12 left, together 


13 frontal and parietal cases as 
trol group. The operations were 
the relief of focal epilepsy caused 
atrophic lesions of the cortex 
ly dating from birth or early in- 
(85). All patients were given a 
ry of sixteen tests before opera- 
and again at the time of their 
arge from the hospital about 
weeks later, using equivalent 
of the same test wherever pos- 
. The operations were carried 
Out under local anesthesia, and Pen- 
"field attempted to reproduce the 
i ES. aura by electrical stimula- 

of the exposed cortex; the epilep- 
Mogenic area was then excised. It is 
important to note that the greatest 
ormality was usually found on 
e ventral and mesial surfaces of the 
poral lobe, and that the removal 
“tissue often extended to include 
le uncus, amygdaloid nucleus, and 
1 campus. 
The experimental and control 
— groups were well matched with re- 
pow to age, vocabulary, and occupa- 


Y io al status, as were also the right 
“and left temporal-lobe groups. The 
@verage age for all subjects was 28 
“years, with a range from 14 to 45. 
The mean score on Form I of the 
Stanford-Binet Vocabulary (1916) 
Was 28, which is slightly above the 
value for normal adults quoted by 
Weisenberg, Roe, and McBride (109). 
These facts are mentioned to show 
Mat we are not dealing with a senile 
_ Ora mentally retarded group. 
E" main results emerged. The 
E and most consistent was in- 
Fenority of the temporal-lobe group 
ES two visual tests (the McGill Pic- 
ture Anomaly [47] and the Wechsler 
E Arrangement), both of which 
E 4 ve complex picture material 
x ms with everyday social situa- 
4 5. This deficit was present before 
oo and therefore cannot be 
E. buted to visual field defects. 
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The second finding was the E T 
inferiority of the right temporal-lobe 
group on tests of spatial patterning, 
both visual and tactual. These differ- 
ences were present before operation, 
but were intensified after removal of 
the temporal lobe, a significant 
further loss being observed in the 
right temporal group on one test of 
block design. 

That there should be a visual de- 
ficit in human temporal-lobe damage 
is completely consistent with the ani- 
mal studies reviewed earlier in this 
paper, the only difference being that 
in the monkey there must be a bi- 
lateral lesion before any visual deficit 
can be discovered. However, in man, 
as in monkey, the precise nature of 
the deficit is not clear. We are justi- 
fied in concluding that it is specifi- 
cally visual and not a general impair- 
ment of “social intelligence,” since 
no defect was seen on verbal tests, 
such as the McGill Verbal Situation 
(47), which are also designed to 
measure some kind of social aware- 
ness. On the other hand it is remark- 
able that no visual tests, other than 
those mentioned, discriminated be- 
tween the temporal-lobe patients and 
the frontal and parietal, although the 
test battery included the Wechsler 
Picture Completion, the Benton 
Visual Memory, the C.M.M. Picture 
Analogies, and a shortened form of 
the Raven Progressive Matrices. It 
seems that the deficit appears when 
attention has to be given to many 
aspects of a complex picture, or when 
different pictures have to be arranged 
in meaningful order on the basis of 
slight differences of detail. Visual 
discriminations of this kind are well 
practiced in our culture, but they 
may be difficult to acquire initially. 
It is clear that much experimental 
work is needed before we can identify 
the factors making these tests diffi- 
cult for patients with temporal-lobe 
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lesions. The schematic nature of the 
drawings, the presence or absence of 
social anomalies, and the mode of 
presentation of the test may all prove 
to be important. 

The poor performance of the right 
temporal-lobe group on the Halstead 
Tactual Formboard (39) and on two 
visual tests of block design provides 
support for those who contend that 
the right posterior association cortex 
is more important than the left for 
the effective organization of space 
(48, 49, 71, 72). 

However, not all patients in the 
right temporal group showed this 
spatial disability; the most severe 
disturbance occurred in cases where 
the removal extended far back along 
the inferior surface of the temporal 
lobe. This may mean either that the 
posterior temporal region is more im- 
portant than the anterior for the per- 
ception of spatial patterns, or that 
the important factor is the size of the 
lesion in the right posterior cortex. 


SUMMARY 


In both monkey and man temporal- 
lobe damage can cause emotional 
and intellectual changes. The present 
paper has dealt only with the latter. 

'The main deficit is a visual one. In 
the monkey, bilateral removal of the 
temporal lobes severely hampers 
visual learning and retention, and in 
man even a unilateral lesion can im- 
pair the understanding of complex 
pictorial material. This loss is not an 
agnosia in the strict sense, nor is it a 
simple forgetting of learned dis- 
criminations. 

Selective ablation studies in the 
monkey have shown that there is a 
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"focus" on the ventral surface of the 
temporal lobe, damage to which is 
almost as detrimental to visual func- 
tion as is complete lobectomy. Simi- 
lar effects follow interruption of the 
transcortical and subcortical connec- 
tions of this region. In human 
epileptic patients, the temporal-lobe 
cases showing pronounced visual de- 
ficit are those with focal lesions in- 
volving the inferior and mesial as- 
pects of the temporal lobe. 

These observations naturally raise 
the question of the central nervous 


pathways implicating the temporal - 


cortex in specifically visual functions. 
It has been argued here that the data 
do not easily lend themselves to 
analysis in terms of transcortical 
systems alone, and that the possible 
importance of connections hetween 
the pulvinar and the temporal cortex 
should not be overlooked. 

Any evaluation’ of temporal-lobe 
function in man requires a further 
breakdown into left and right tem- 
poral-lobe lesions. On the left, lan- 
guage disturbances of varying sever- 
ity commonly follow lesions which 
interfere with the normal functioning 
of the speech area in the posterior 
temporal region, though it is pointed 
out that a clean surgical removal is 
apt to produce far milder effects than 
the continued presence of abnormally 
functioning tissue. On the right, 
there may be deficits of space per 
ception, visual and nonvisual. Suc 
deficits have also been found for right 
parieto-occipital lesions, so that We 
are justified in regarding the right 
posterior association cortex as Spe 
cially important for the perception of 
spatial patterns. 
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In a recent paper Sidman! has in- 
dicated a devastating criticism of 
a great deal of current and historical 
psychological research. He has 
demonstrated in the particular case 
of Hull's exponential growth function 
that if the functional relationship be- 
tween two variables, x and y, for 
any particular individual is 


y -f(2) 1] 


then the average y-value, 5, is of the 
form 


5 = g(x) [2] 


where g(x) may be a fundamentally 
different equation from f(x). He 
shows that if the learning curve for 
an individual is 


y= M — Me* [3] 


then the form of the function based 
on the average y is not necessarily 
the same as [3]. 

Sidman indicates that the kind of 
analysis he has made can be applied 
to any function. In this paper we at- 
tempt to generalize Sidman's results 
for any functional relationship, and 
to provide a criterion for deciding on 
the legitimacy of the averaging opera- 
tion as a device for making inferences 
Concerning individual functional re- 
lationships. 

The major tool that we avail our- 
Selves of is the Maclaurin series: 


Ae rae eo A note on functional rela- 
ined from group data. Psychol. 
Bull., 1952, 49, 263-269. is i 


x 
f(x) = f(0) + 2f' + 3:489 
: (4) 
x 
= op) +e 
+ rtd (0) + 
where f(x) is our function, f(0) is the 
value of the function when x=0, 
f'(0) is the value of the first deriva- 
tive of the function when x —0, etc. 
Thus, if 
y-f(aboimn 
for a particular individual, then 


a ee 
n 


[5] 


ex» [6] 


n 
Applying Maclaurin's series, we get 


x? 
y = yt yx wy 
(7] 
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The criterion which we offer to de- 
termine the legitimacy of the averag- 
ing operation as a basis for making in- 
ferences concerning individual func- 
tional relationships is as follows: If 
the coefficients Jo, Jo’, Jo”, etc. are 
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simply functions of the average 
parameters, d, b, č, etc., then the 
operation may be considered legiti- 
mate. When this criterion is satis- 
fied, the form of the average curve 
will be identical with the form of each 
individual curve. It will differ from 
the individual curves only in the 
parameters of the function. When 
the criterion is satisfied the parameter 
of the group function will be related 
to the parameter of any individual 
function in the same way that any 
mean is related to any individual 
score. If we restrict ourselves to 
functions which satisfy the criterion, 
we may then make inferences con- 
cerning the nature of individual func- 
tions from the nature of group func- 
tions, since the functions will have 
the same form. 
To demonstrate the application of 
- the criterion we will examine two 
functions 


y = ax? + be +c [9] 
and 
y=M— Me*. [10] 
If y=ax*+bx-+c, then 
youc 
y = 2ax+b yo =b [11] 
y" = 2a yo" = 2a. 


Therefore, applying [7] and [8], we find 
ysccbxzdax [12] 
for the individual, and 
j= e+ bx + ax? [13] 


for the group. Thus, the criterion is 

satisfied and we can consider the 

averaging a legitimate operation. 
If, however, y= M — Me-", then 


yo = 0 


y = Mke-** yo = Mk 


DAVID BAKAN 


y'= — Mic y” = — MP [M 
y"- Mc" yo” = ME 


Applying [7] and [8], we find 


x 
y= 0+ Mks- Mi 


x ü 
t pe C 
LET 
and 
5 = 0+ Wiz — (MR) E 
E 28 
fat: 
+ GF) = : 


In [16] the values of the coefficient 
are not simply functions of the aver 
age parameters, M and &, but rather 
averages of the products, Mk, Mi 2, 
etc., and therefore the averagin, 
process is not legitimate, as has bee! 
indicated by Sidman for this func 
tion. It will be apparent that th 
failure of the exponential grov 
function to satisfy the criterion re 
sults from the fact that is dependent 
not upon the parameters, as such, 
but upon a variety of products © 
the parameters; and the mean of 
series of products is not necessaril 
the same as the product of the mea 
Since the average curve depends upon 
these individual products, no infer 
ences can be made about the naturi 
of the individual functions from 
average curve without knowledge 0! 
these individual products. Further 
more, if we knew enough about th 
individual functions to ascertain thesi 
individual products, the averagt 
curve would be quite gratuitous. 
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“THE 


RESPONSE TO COLOR AND EGO FUNCTIONS”: 


A CRITIQUE IN THE LIGHT OF RECENT 
EXPERIMENTAL EVIDENCE 


J. D. KEEHN 
Institute of Psychiatry, Maudsley Hospital, London 


In a recent article in this Journal 
Fortier (5) set out a theory intended 
to establish a correspondence be- 
tween color and affect. Beginning 
from and integrating the theoretical 
views of Schachtel (12) and Rickers- 
Ovsiankina (11), Fortier went on to 
present a number of publications al- 
legedly supporting the view that re- 
sponse to color is indicative of emo- 
tionality. It is the purpose of this 
note to examine briefly the validity 
of the evidence brought forward in 
support of this theory. 

The pertinence of the views of 
Schachtel and Rickers-Ovsiankina 
need not concern us here. Suffice it to 
say that they attempt logically to 
equate color with affect. Instead, it is 
more important to examine the ex- 
perimental evidence purporting to 
throw light on the efficacy of this 
logic. 

Three main lines of experimental 
evidence are cited: (a) evidence from 
the Rorschach test, (b) evidence from 
the Mosaic Test, and (c) evidence 
from examination of finger paintings. 
, In view of the overwhelming ma- 
jority of studies quoted by Fortier 
making use of the Rorschach, this 
Paper is confined to an examination 
of these studies. This is not unrea- 
sonable if one remembers that finger 
paintings and the Mosaic Test are 
in no way standardized and therefore 
Pena be regarded as of sufficient 
theo ity for the adequate testing of a 
Coli As Fortier finds no correla- 

etween the results obtained 


from the Rorschach and those from 
the Mosaic Test, thus being forced 
into the construction of a subsidiary 
ad hoc hypothesis, this restriction 
should act in his favor. 

Examination of the evidence pre- 
sented shows that all the experiments 
and observations quoted make use of 
the principle of external validation. 
Thus, different groups of varying de- 
grees and modes of “emotionality” 
were used and the relationship be- 
tween “emotionality” and Rorschach 
color score indicated. However, in 
many cases, instead of finding the ex- 
pected relationship Fortier finds re- 
sults not in accordance with his 
views. Thus, when he finds that 
Rorschach investigations do not sup- 
port Burt's (2) view on the emo- 
tionality of delinquents, Fortier dis- 
cards this view for another which is 
based upon Rorschach findings, and so 
finds an impressive correspondence 
between the new view of the affec- 
tive state of the delinquent and his 
color score on the Rorschach. Simi- 
larly, when he finds that Rorschach 
findings do not fit in with the tradi- 
tional view, he discards the orthodox 
view of the emotional sterility of the 
schizophrenic in favor of Beck's (1) 
opinion that those patients do have 
emotional experiences. All this de- 
spite the fact that Beck’s opinion is 
largely derived from his experience 
with the Rorschach test. There are 
other instances of similar ad hoc 
reasoning, but to continue would be 
pointless. 
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Even with the genetic studies 
Fortier finds it difficult to prove his 
point and complains that little evi- 
dence is available. But were Fortier 
acquainted with the works of Lind- 
berg (9), Eysenck (4), and others he 
would find numerous papers quoted 
demonstrating the decrease of color 
reaction with increasing age in chil- 
dren, evidence which would enable 
him to accept the general view of the 
increase of emotional control with 
age. This evidence comes from ex- 
periments with tests of color-form 
attitude, a line of approach which 
Fortier seems unaware of, for there is 
no mention of it in his text, although 
one reference does appear in the 
bibliography (Oeser [10]). However, 
no studies using the external cri- 
terion of “emotionality” are of much 
use in attempting to find a personal- 
ity correlate of color reactivity, for 
emotionality itself is but a vague and 
ill-defined term. 

If one wishes to discover whether 
or not reaction to color in general 
does have some correlate in the field 
of personality, then some indirect 
approach must be used. One such 
approach would be that of internal 
validation, This involves administer- 
ing a number of tests involving color 
as a scorable item and correlating the 
resultant test scores. Unless signifi- 
cant correlations were obtained be- 

-tween such tests, color reactivity 
could not be regarded as having any 
personality correlates, for if a par- 
ticular personality attribute were 
associated with high reaction to color 
then this reaction would have to oc- 
cur on all tests involving color reac- 
tion as a scorable response. 

Examination of the data of Eysenck 
(4), Clarke (3), and Lindberg (9) (re- 
worked) shows an average correla- 
tion between color-form tests of the 
order of .2. This leads the first two 


J. D. KEEHN 


of these authors to conclude that, ex- 
cept in the eventuality of isolating a 
factor of color-form attitudes, reac- 
tions to color-form tests are relatively 
specific to each test. Lindberg reached 
the opposite conclusion through a 
false statistical analysis, but the re- 
analysis of his data as shown in 
Keehn's study (8) supports the 
opinions of Eysenck and Clarke. 

However, a factor analysis of the 
intercorrelation matrix obtained from 
a number of color-form tests given to 
a large group of subjects did result 
in the extraction of a color-form fac- 
tor (8). Evidence presented from the 
same analysis (7) showed that the 
conventional Rorschach color score 
had no saturation on this factor but, 
instead, had a loading on a factor of 
whole-part attitude.| Thus the re- 
sults from this study were interpreted 
as an indication that color was used 
as a determinant in the Rorschach 
only if the subject attempted to in- 
tegrate as much of the stimulus as 
possible into his response. This shows 
the traditional view of the role of 
color in the Rorschach test to be in- 
correct. 

Support for the need of some re- 
interpretation of the part played by 
color in this test comes from an 
analysis of the experiments carried 
out to determine the validity of the 
concept of color shock. This concept 
is also relevant to the color-emo- 
tionality hypothesis, being, in effect, 
an extension of it paralleling the con- 
striction of emotion with the suppres 
sion of color responses. In a revieW 
of a number of studies with color 
shock (6) hardly any evidence for 
the dependence of color shock signs 
upon color was found. 

Thus, and this is the crux of the 


1 This evidence, of course, was not available 
to Fortier during the preparation of his pape 
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matter, inasmuch as the Rorschach 
color responses are at best only in- 
directly affected by color, the use of 


color-emotionality hypothesis is in- 
valid. Hence the great majority of 
the evidence put forward by Fortier 


the results of Rorschach studies in support of his theory must be re- 
either to support or to refute the  garded as irrelevant. 
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AN APPRAISAL OF KEEHN'S CRITIQUE OF "THE RESPONSE TO 
COLOR AND EGO FUNCTIONS" 


ROBERT H. FORTIER! 
Northampton, Massachusetts 


There are essentially three main 
criticisms levied by Keehn: 

1. Criticisms based upon Keehn's 
recently completed work. It becomes 
difficult to formulate a reply to criti- 
‘isms derived from original work 
Which has not yet been published. 
: at can be said here is based only 
ps the brief excerpt which Keehn 
ncluded in his critique. Keehn 


1 
Ew at the VA Hospital, Northampton, 


states: “ .. . inasmuch as the Ror- 
schach color responses are at best 
only indirectly affected by color the 
use of Rorschach studies either to 
support or to refute the color- 
emotionality hypothesis is invalid.” 
The path which he follows from his 
studies to this conclusion is rather 
difficult to follow, particularly as it 
applies to the writer's theoretically 
oriented review. 

Keehn does make two direct state- 
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ments in his critique before arriving 
at the rather sweeping conclusion 
quoted above: (a) “This [referring to 
his color-form factor analysis] shows 
the traditional view of the role of 
color in the Rorschach test to be in- 
correct.” (b) "...hardly any evi- 
dence for the dependence of color 
shock signs upon color was found." 

In regard to the first statement, the 
writer is largely in agreement with 
Keehn that the traditional view of 
the role of color in the Rorschach test 
is in many respects incorrect. Such 
was the point and purport of his en- 
tire paper. However, an evaluation 
of this statement by Keehn could 
have been better made if Keehn had 
stated its rationale more specifically. 
In regard to Keehn's findings of color 
shock and its relevance to the role of 
color, it can be seen by a scrutiny of 
the writer's paper (2) that in no whit 
does the theory he presents depend 
upon any interpretation of color 
shock. 

If the two statements quoted here 

are the vehicles which carried Keehn 
to his final conclusion, it can be seen 
that his conclusion has no relevance 
to the writer's paper. 
Keehn, upon receipt of my ap- 
praisal of his critique, very kindly 
sent me a slightly revised copy of the 
critique, in which he stresses some- 
what more the interpretation of his 
color-form factor analysis. This re- 
vision, in turn, prompted me into 
dealing more directly with these in- 
terpretations. They are as follows: 

a. Reactions to color-form tests 
are relatively specific to each test. 

b. "Thus, the results from this 
study were interpreted as an indica- 
tion that color was used as a deter- 
minant in the Rorschach only if the 
subject attempted to integrate as 
much of the stimulus as possible into 
his response.” 
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In regard to a, the writer wishes to 
point to what Keehn has called this 
writer's "subsidiary ad hoc hypoth- 
esis" concerning the Mosaic Test, 
The writer, in that formulation— 
whatever it may later be called— 
stressed that the specific response to, 
or mode of usage of, color is de- 
pendent upon the inherent dynamics 
of the Mosaic Test—or the functions 
which the individual had to perform 
in executing the test. Color was still 
related to affect in the manner in 
which it was sketched in the theory. 
The fairly strong implication was 
that the specific responses elicited by 
one test might be different from re- 


sponses elicited by another test - 


simply because the operations satis- 
fying the requirements of the two 
tests might be different. 


In regard to b, if affects can be re- | 


garded as part of the totality of any 
situation, that individual who is re- 
garded as more capable of handling 
affects is the one who can best in- 


tegrate affects into the totality of the 


situation, and can control them. 

2. Failure to pursue the line of at- 
tack developed by Lindberg and others. 
The original bibliography compiled 
by the writer consisted of several 
hundred articles. Limitations of 
space prevented a meaningful dis- 
cussion of some of these studies. 
Other studies were ambiguous oF 
otherwise unclear. In an origi 
draft of his review paper, the writer 


included the particular study by - 


Lindberg (3) which Keehn cited. 


However, he found that a great many — 


of the interpretations and conclusions 
made by Lindberg were completely 
immersed in a system of psychologY 
which had not appeared to any great 
extent in the American literature. Pe 
felt that a fair treatment of Lind- 
berg's interpretations and conclusions 
could not be made without dealing @ 
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great length with the particular psy- 
chological system used by Lindberg. 
Because space was limited, such an 
elaboration was impossible, and Lind- 
berg's study was dropped. 

3. The utilization of ad hoc hy- 
potheses. It was the writer's intent 
“to present a theory . . . and second, 
to examine a number of studies in- 
volving color in an effort to sub- 
stantiate and clarify the theory" (2, 
p. 41). Although this theory has been 
in existence for a number of years, it 
is by no means in common usage. 
Following this presentation, the writer 
reviewed a number of studies to test 
the theory. In its strict sense, the 
term "test" would have implied in 
the situation under discussion a mere 
listing of studies with the words 
"yes" and “no” indicating whether 
the study supported the theory or 
refuted it. And following such a list- 
ing, one could perhaps have taken a 

vote" by counting the “yeses” and 
“noes.” The writer felt that much 
more could be gained by subjecting 
the data to a logical analysis, partic- 
ularly when interpretations and con- 
clusions from the data were con- 
tradictory to prevalent points of 
view? In other words, particularly in 
those cases where interpretations and 
conclusions from data were contrary 
to prevalent views, the writer made 


! It is the writer's opini i 
1 pinion that his adapta- 
tion of Schachtel's and Rickers-Ovsiankina's 
E is not among the prevalent points of 
; and he so stated in his paper. 


reinterpretations of the original data, 
stated the reinterpretations, and sub- 
sequently attempted to examine their 
implications. The reinterpretations 
and inferences derived therefrom 
thus become “ad hoc" in the sense 
that these formulations were not 
made until the studies were seen, and 
their conclusions suggested the need 
for reinterpretation. Benjamin has 
this to say concerning ad hoc con- 
structs or explanations: "Explana- 
tion consists in the transfer of prop- 
erties and relations from the explana- 
tory entity to that which is to be 
explained; in explanation one in- 
creases the content of the data by 
adding to them the features of the 
hypothesis. But if a construct has 
been obtained by direct derivation 
from the data, there is clearly no pos- 
sibility of turning about and explain- 
ing the data in terms of the construct. 
The construct does not contain that 
increment of novelty without which 
explanation is impossible. A pure 
construct used as an explanatory en- 
tity produces an ad hoc or verbal ex- 
planation" (1, p. 186). The writer 
drew his reinterpretations and in- 
ferences in many cases from areas 
which lay outside the specific data 
reported in the studies which he re- 
viewed. Thus, in the sense in which 
the term is used by Benjamin, the 
reinterpretations are not ad hoc. The 
value of Keehn's criticism would 
have been enhanced had he specified 
the sense in which he felt these rein- 
terpretations to be ad hoc. 
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CURRENT INTERPRETATION AND SIGNIFICANCE OF 
LLOYD MORGAN'S CANON 


EDWARD NEWBURY 
University of Kentucky 


In 1932 Nagge (14) showed that 
the meaning of Lloyd Morgan’s 
Canon had been greatly transformed 
by later psychologists so that, per- 
mitting a variety of interpretations, 
it was difficult to apply and needed 
clarification. In some instances more 
recently, as in the earlier paper of 
Adams (1), the Canon has been in- 
terpreted as a version of the law of 
parsimony (Boring, 2, pp. 474, 498, 
623, 629; Griffith, 4, p. 322; Harri- 
man, 5, p. 255; Hilgard, 6, p. 343; 
Munn, 12, p. 2; 13, p. 2; Skinner, 
17, p. 4; Warden, Jenkins, and 
Warner, 20, p. 26; Waters, 22, p. 24; 
Woodworth, 25, pp. 49 f). In 
several of these (2, pp. 563, 623; 5, 
12, 13, 25) and others (18, p. 250; 24, 
pp. 124, 747), the Canon has been 
explicitly interpreted as a doctrine of 
simplicity. It has been related to 
Occam's Razor (2, p. 498; 4, 5, 25), 
has had imputed to it an anti- 
mentalism (4, 5, 17), or what amounts 
to a behavioristic reductionism (17), 
but also a futile subjectivism (20, 
pp. 45-54). The Canon has been 
criticized on several of these bases. 
This variety of interpretations poses 
a problem in both accuracy and the 
historical continuity of thought. 
What did Morgan say? 

The Canon and its context. Lloyd 
Morgan was mainly concerned with 
the continuity of mental evolution. 
His Canon was a methodological rule 
for placing animals in the scale of de- 
velopment. The first statement 
(1894) was: "In no case may we in- 
terpret an action as the outcome of 
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the exercise of a higher psychical 
faculty, if it can be interpreted as tl 
outcome of the exercise of one whi 
stands lower in the psychologica 
scale" (7, p. 53). In a revised ed 
Morgan added to this a clarifying 
restatement: “In no case is an anin 
activity to be interpreted in terms 0 
higher psychological processes, if 
can be fairly interpreted in terms of 
processes which stand lower in 
scale of psychological evolution and 
development” (8, p. 59). In 1925 hi 
extended the range of the Canon | 
embrace his now explicit emergent 
evolution: “In no instance should 
interpret events in terms of conce 
appropriate to a higher level of 
emergence if they can adequately 
interpreted in terms of concepts à 
propriate to a lower level of e 
gence" (10, p. 61). And in his A 
can autobiography (1932) he spe 
of “the evolutionary canon that: 
should not interpret an earlier 
lower stage of mental development 


The original Canon was set i 
context of double-aspect moni 
Introspection was necessary for 
rect acquaintance with psychi 
processes. Double inductions W 
required for inferring mental proc 
esses in other organisms, either hu 
man or animal, in which “inducti 
reached through the objective stt 
of certain physical manifestation: 
have to be interpreted in terms of in: 
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ductions reached through the in- 
trospective study of mental proc- 
esses” (7, p. 47, cf. pp. 36-53). 

The context was also one of an 
evolutionary naturalism which recog- 
nized the appearance of novel func- 
tions in a process called "selective 
synthesis" (7, ch. 19). “Synthesis, 
with new properties at critical turn- 
ing-points, was the burden of my 
evolutionary contention,” Morgan 
later commented, changing the term 
to "emergence" (9, p. 302 ff.). 

While Morgan recognized that a 
given process might be more highly 
developed in one animal type than in 
another, he was not willing to accept 
a reductionism whereby “‘all forms of 
animal life from the amoeba upwards 
have all the faculties of man," or 
that "in the higher forms of life the 
introduction of the higher faculties 
has been effected by some means 
other than that of natural evolution" 
(7, p. 58). He argued that “any ani- 
mal may be at a stage where certain 
higher faculties have not yet been 
evolved from their lower precursors; 
and hence we are logically bound not 
to assume the existence of these 
higher faculties until good reasons 
shall have been shown for such 
existence" (p. 59). Thus the Canon, 
in Morgan's conception, was one of 
the devices displacing an uncritical 
assumption of identity between the 
human and animal mind. 
pirhe context of the Canon shows 
sm Morgan was not trying to ex- 
Plain the behavior of a single animal 
in a single experimental situation, 
is to determine what character- 
ean consciousness could be at- 
his la E to a given animal type. In 
t vrl clarification, “lest the range 

principle be misunderstood,” 

e an carefully adds "that the 
s n by no means excludes the in- 
Pretation of a particular activity 


in terms of the higher processes, if we 
already have independent evidence 
of the occurrence of these higher proc- 
esses in the animal under observa- 
tion" (8, p. 59). Currently Hilgard 
makes a similar point, but erroneously 
gives it as an inversion of the Canon. 
"As now inverted," he writes, "it 
might be paraphrased: ‘An organism 
capable of ideational problem-solving 
may also use ideas in learning situa- 
tions in which they would be theo- 
retically unnecessary” (6, p. 345). 

Antimentalism and reductionism. 
Despite its introspective basis, the 
Canon has been interpreted as elim- 
inating conscious processes through 
an antimentalism or by implying in 
it a reductionism. Harriman quotes 
Occam's Razor—"entities must not 
be multiplied beyond necessity"— 
and states that "Morgan accepted 
this view, indicating that anecdotes, 
attribution of human mental ac- 
tivities to animals, and projection of 
introspections have no place in ani- 
mal psychology” (5, p. 255). Griffith 
makes the Canon, interpreted on the 
basis of Occam's Razor, appear to 
emphasize physical factors and not 
to necessitate the inclusion of mental 
ones (4, p. 322). Skinner writes: 
“Darwin, insisting upon the con- 
tinuity of mind, attributed mental 
faculties to subhuman species. Lloyd 
Morgan, with his law of parsimony, 
dispensed with them in a reasonably 
successful attempt to account for 
characteristic animal behavior with- 
out them" (17, p. 4). And Munn is 
not clear (13, pp. 1 f.). 

Waters with some approval says 
that "rigid application of Morgan's 
Canon” is criticized by Gestalt psy- 
chologists on the ground of need for 
"qualitative descriptions that are not 
allowed on the basis of this canon" 
(22, pp. 24 f.). Yet the Canon was 
consistent with the concept of emer- 
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gent novelties and the configurational 
qualities such as depth perception in 
which "the out-thereness" results 
from sensations coalescing “into a 
synthesis which has a new and de- 
terminate character” (7, p. 352; cf. 
also ch. 1). The criticism seems to be 
of a misinterpretation rather than of 
a rigid application of the Canon. 

Waters (21) attacks an anti- 
anthropomorphism which he ascribed 
ambiguously either to the Canon or 
to later interpretations of it. What 
Waters in general seems to find 
necessary is a critical anthropo- 
morphism in the sense of introspec- 
tively oriented concepts. Since this 
was the actual basis for Morgan's 
Canon, the value of historical con- 
tinuity of thought is lost in Waters' 
treatment, all the more emphatically 
because he designates Tolman, Kre- 
chevsky, and Maier as anthropo- 
morphic in this sense (pp. 535 f.). 
Similarly, Morgan's current signifi- 
cance is lost when Hilgard follows a 
historical treatment by his own 
criticism of the requirement that 
explanations be only in palpable 
terms (6, pp. 343 f.). And the hu- 
manistic, textbook aims of Stagner 
and Karwoski are hardly realized by 
stating that Morgan "attacked 'an- 
thropomorphism' (interpreting ani- 
mal behavior in human terms) with 
his famous canon" (18, p. 250). The 
Canon is a device for'selecting in in- 
terpretation introspectively derived 
processes, not for completely elimi- 
nating them, which would be to 
eliminate use of the Canon (cf. 11, 
p. 261). 

Occam's Razor and parsimony. The 
antimentalistic contradiction of ap- 
plying Occam’s Razor through the 
Canon was noted above in Griffith 
and Harriman. In the sense of Harri- 
man’s literal translation (supra) of 
the classical Latin statement (19), 


Occam's Razor is applied when we 
adhere to a paucity of assumptions, 
whereas Morgan’s Canon refers to 
lower processes of development. The 
total number of scientific entities is 
not decreased by the Canon in so far 
as they are introspectively available, 
but the number of emergent func- 
tions attributed to an animal of a 
given type may be decreased, so that 
for this animal the net effect is in line 
with Occam's Razor, while the ad- 
monition in the Canon regards the 
lowness of the functions, not their 
paucity. 

As an application of the law of 
parsimony and of Occam's Razor, 
Boring argues that however useful 
the Canon may have been in coun- 
teracting extremes of Romanes' an- 
thropomorphism, times and problems 
have changed, and entities of neces- 
sity must now be increased (2, pp. 
474, 498, 563). Since the Canon al- 
lows higher processes when inde- 
pendent evidence makes their as- 
sumption necessary, one may rea- 
sonably ask, why not further de- 
crease probability of error by utiliz- 
ing knowledge of developmental se- 
quences? 

While Occam's Razor and the law 
of parsimony have no doubt some- 
times been identified in science, the 
latter especially is a loose term sub- 
ject to a variety of meanings, some 0 
which go as far back as Aristotle 
(2, pp. 498 f.; 3, p. 231; 14; 15, p. 472; 
16, pp. 114 f.; 19; 23, p. 155). Itis 
only confusing to say, without ade- 
quate qualification, that Morgans 
Canon is derived from either, and 0 
doubtful accuracy to identify it with 
them. The confusion can be com- 
pounded by the doctrine of simplicity: 

Simplicity of explanation. Wood- 
worth criticizes Morgan’s Canon as 4 
doctrine of simplicity, raising the 
question of what is simpler. Ani 


wehologists have usually regarded 
vement as simpler than percep- 
b he says, but “judging by our- 
es we should say that nothing is 
than seeing an object and that 


ition" (24, p. 124). 

h a criticism was anticipated 
Morgan himself (7, p. 54). He 
, first, that "surely the 
ity of an explanation is no 
criterion of its truth," and 


is not the one accepted by 
" (pp. 54 f.). Second, "the 
ity of the explanation of the 
omena of animal activity as the 
t of intellectual processes, can 
y be adopted on the assumption of 
lative complexity in the men- 
mature of the animal as agent" 
$5). This assumption had to be 
fied by induction. Thus Morgan 
nguished between the doctrine of 
city and his Canon, which 
es no reference to it. 

organ did suppose that in evolu- 
the earlier physical and mental 
esses were simpler (p. 8). He 
lated an evolutionary develop- 
t with an increasing complexity 
Organic structure, of correlated 
Hvities, and of mental and psychic 
n tions (p. 55). This made the 
blem of psychic level which logi- 
y justified the Canon. But in this 
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development appeared new prop- 
erties. Indeed, years later Morgan 
turned his 1925 version of the Canon 
on the radical behaviorist by saying 
that "we should not interpret the 
lower forms of trial and error as 
affording evidence of trial and error 
of the kind that characterises reflec- 
tive procedure" (10, p. 61). Thus a 
doctrine of higher and lower in the 
level of development can be consist- 
ent with the presumptive fact of 
evolutionary development from sim- 
plicity to complexity without the 
methodological rule being one of sim- 
plicity at all. Failure to recognize 
this possibility seems to explain a 
good deal of the misinterpretation of 
the Canon. 

Comment. Aside from their his- 
torical inaccuracies, many current 
misinterpretations of — Morgan's 
Canon have sui generis failed to take 
advantage of possible logical develop- 
ments. Without contending that 
Morgan's methodology represents 
the last word, one can recognize in 
it some of the essentials for inte- 
grating modern introspective and 
comparative psychology. Whether 
this gain through historical conti- 
nuity can be realized depends upon 
an accurate and significant interpre- 
tation of that methodology, including 
the Canon. 
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It is not easy to find the dates of 
death of prominent men because the 
who's-who compendia usually omit 
the sketch when the man dies and 
thus never publish the date. For 
this reason one of us published in 
1928 the dates of death of 248 promi- 
nent psychologists who died during 
the twenty-five year period 1903- 
1927 (1). Now, a quarter of a cen- 
tury later, we undertake for the same 
reason to present a similar list for 
the next twenty-five years, 1928- 
1952. How did we get the names and 
data? 

We searched for this period Psycho- 
logical Abstracts, the Directories of the 
American Psychological Association, 
American Men of Science, Science, 
and Archives de Psychologie, which 
was especially rich in European 
notices during Claparéde's incum- 
bency. We added, of course (and 
continued later to add), various 
prominent names which came inci- 
dentally to our attention, and we 
came out with a list of 809 deceased 
psychologists, one death every twelve 

ays for a quarter century! Since 
not all these persons were impor- 
tant enough to list in crowded jour- 
nal space, we had to select. 

x So we gave the list of 809 to L. M. 
poen H. S. Langfeld, Wayne 
inh, ]. G. Beebe-Center, E. B. 
oii and the senior author of 
aa paper, asking them to indicate 
Hs v^ names they did not recognize 
van, dies names they recognized 
Bm urther association, which 
: es „they recognized and knew 
omething about, as well as the 


names they recognized but thought 
too unimportant for inclusion in the 
final list and the names they recog- 
nized but thought deviated too far 
in the sociotropic, biotropic, or 
physicotropic direction to count as 
psychologists. Then we edited these 
results, making decisions. 

We omitted such prominent names 
as Ruth Benedict, Bergson, Franz 
Boas, C. B. Davenport, Hrdlicka, 
Korzybski, Malinowski, Raymond 
Pearl, and J. J. Thomson, persons 
who are or were important to psy- 
chologists but not, in our opinion, 
quite psychologists. We allowed 
ourselves, as our list shows, to deviate 
more freely in the direction of physi- 
ology and neurology, in accordance 
with psychology’s tradition in which 
physiologische Psychologie was more 
important than Valkerpsychologie. 
So we include Cajal, H. H. Donald- 
son, Ferrier, Flechsig, W. M. Wheeler, 
and some other biotropes. We let in 
sociotropes as remote as Havelock 
Ellis, and the important psycho- 
analysts, like Adler, Ferenczi, and 
Rank. Freud, of course. Nor could 
we quite bring ourselves to exclude 
Karl Pearson. 

Next we had regard to the votes 
of our jury of six. We accepted for 
the list every name, not already 
excluded as peripheral to psychology, 
that received six, five, or four votes 
as being known about in addition to 
being barely recognized. That gave 
us nearly all of our list. Finally we 
accepted 12 names with only three 
votes, mostly names where once 
famous psychologists were known to 
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the older members of the jury and 
not to the younger. The senior au- 
thor then translated his prejudices 
into authority and took in three 
persons with only two votes and 
Beatrice Edgell whose contribution 
to psychology was recognized by 
none of the jury other than himself. 
Nobody was included on bare recog- 
nition alone. Thus we obtained a 
list of 193 persons, to which we have 
added gratuitously entries for three 
important psychologists who died 
early in 1953: Kafka, Katz, and 
Marbe. 

The list is published for the pur- 
pose of recording exact dates of 
death, Instead of giving age at time 
of death we have included in all 
cases exact date of birth. It has not 
been easy to obtain these data. We 
have consulted over two dozen files, 
from the New York Times and Who 
Was Who in America to the published 
necrologies in the standard scientific 
journals. For the last score of entries 
we wrote letters to persons who 
might know about the deceased or 
have ready access to local records. 
Professor P. Fraisse of Paris and 
Professor H. v. Bracken of Braun- 
schweig have been particularly help- 
ful in obtaining some of the French 
and German items. 

Our geographical entry is meant 
for identification. It is not the place 
of death, which is sometimes an ob- 
scure locality to which an old man 
had retired. We have tried to give 
what seemed to us to be the place of 
last important professional activity. 

Here is the list. 


Ach, Narziss Kasper, Göttingen, b. 
7 June 1848, d. 25 July 1946. 

Adler, Alfred, Vienna and New York 
City, b. 7 Feb. 1870, d. 28 May 
1937. 

Adler, Herman Morris, Univer. Cali- 


fornia, b. 10 Oct. 1876, d. 6 Dec. 
1935. 

Angell, Frank, Stanford Univer., b. 8 
July 1857, d. 2 Nov. 1939. 

Angell, James Rowland, Yale Uni- 
ver., b. 8 May 1869, d. 4 Mar. 1949, 

Angier, Roswell Parker, Yale Univer., 
b. 21 Oct. 1874, d. 24 June 1946. 

Arps, George Frederick, Ohio State 
Univer., b. 23 Jan. 1874, d. 16 
Sept. 1939. 

Aveling, Francis, London, b. 25 Dec. 
1875, d. 6 Mar. 1941. 

Babcock, Harriet, New York City, 
b. 7 Jan. 1877, d. 17 Dec. 1952. 
Bagley, William Chandler, Columbia 
Univer., b. 15 Mar. 1874, d. 1 July 

1946. 

Baldwin, Bird Thomas, Univer. Iowa, 
b. 31 May 1875, d. 12 May 1928. 

Baldwin, James Mark, Paris, b. 12 
Jan. 1861, d. 8 Nov. 1934. 

Becher, Erich, Munich, b. 1 Sept. 
1882, d. 5 Jan. 1929. 

Bingham, Walter Van Dyke, Wash- 
ington, D. C., b. 29 Oct. 1880, d. 8 
July 1952. 

Bleuler, Paul Eugen, Zürich, b. 30 
Apr. 1857, d. 15 July 1939. 

Bolton, Thaddeus Lincoln, Temple 
Univer., b. 27 July 1865, d. 3 Jan. 
1948. 

Book, William Frederick, Indiana 
Univer., b. 10 June 1873, d. 22 
May 1940. 

Bourdon, Benjamin Bienaimé, Ren- 
nes, b. 5 Aug. 1860, d. 11 July 1943. 

Breese, Burtis Burr, Univer. Cincin- 
nati, b. 17 May 1867, d. 31 July 
1939, 

Brett, George Sidney, Univer. To- 
ronto, b. 5 Aug. 1879, d. 27 Oct. 
1944. 

Brigham, Carl Campbell, Princeton 
Univer., b. 4 May 1890, d. 24 Jan. 
1943. i 

Brill, Abraham Arden, Columbia 
Univer., b. 12 Oct. 1874, d. 2 Mar. 
1948. 


2 


En 1 


"Buchner, Edward Franklin, Johns 


Hopkins Univer., b. 3 Sept. 1868, 
d. 22 Aug. 1929. 

Burks, Barbara Stoddard, Columbia 
Univer., b. 22 Dec. 1902, d. 25 May 
1943. 

Burnham, William Henry, Clark 
Univer., b. 3 Dec. 1855, d. 25 June 
1941. 

Burrow, Trigant, Lifwynn Found., 
b. 7 Sept. 1875, d. 24 May 1950. 
Cajal, Santiago Ramén y, Madrid, 
b. 1 May 1852, d. 17 Oct. 1934. 


‘Calkins, Mary Whiton, Wellesley 


Coll., b. 30 Mar. 1863, d. 26 Feb. 
1930. 

Cannon, Walter Bradford, Harvard 
Univer., b. 19 Oct. 1871, d. 1 Oct. 
1945. 

Cason, Hulsey, Univer. Miami, b. 21 
Feb. 1893, d. 30 Apr. 1951. 

Cattell, James McKeen, New York 
City, b. 25 May 1860, d. 20 Jan. 
1944, 

Claparéde, Edouard, Geneva, b. 24 
Mar. 1873, d. 29 Sept. 1940. 

Coghill, George Ellett, Wistar Inst., 
b. 17 Mar. 1872, d. 23 July 1941. 

Conklin, Edmund Smith, Indiana 
Univer., b. 19 Apr. 1884, d. 6 Oct. 
1942, 

Coover, John Edgar, Stanford Uni- 
ver., b. 16 Mar. 1872, d. 19 Feb. 
1938. 

Dearborn, George Van Ness, New 
York City, b. 15 Aug. 1869, d. 12 
Dec. 1938. 
ecroly, Ovide Jean, Brussels, b. 23 
July 1871, d. 12 Sept. 1932. 
elabarre, Edmund Burke, Brown 
Univer., b. 25 Sept. 1863, d. 16 
Be 1945. 
elacroix, Henri, Paris, b. 2 i 
1873, d. 3 Dec. 1937. oM 
€ Sanctis, Sante, Rome, b. 7 Feb. 
1862, d. 20 Feb. 1935. 
^ Max, Berlin, b. 8 Feb. 1867, 

- 19 July 1947, 
*wey, John, Columbia Univer., b. 20 
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Oct. 1859, d. 1 June 1952. 

Dockeray, Floyd Carlton, Ohio State 
Univer., b. 15 May 1880, d. 15 
Jan. 1949. 

Dodge, Raymond, Yale Univer., b. 
20 Feb. 1871, d. 8 Apr. 1942. 

Donaldson, Henry Herbert, Wistar 
Inst., b. 12 May 1857, d. 23 Jan. 
1938. 

Downey, June Etta, Univer. Wyo- 
ming, b. 13 July 1875, d. 11 Oct. 
1932. 

Drever, James, Edinburgh, b. 8 Apr. 
1873, d. 11 Aug. 1950. 

Dumas, Georges, Paris, b. 6 Mar. 
1866, d. 13 Feb. 1946. 

Duncker, Karl, Swarthmore Coll., 
b. 2 Feb. 1903, d. 23 Feb. 1940 

Dunlap, Knight, Univer. California 
Los Angeles, b. 21 Nov. 1875, d. 14 
Aug. 1949. 

Dusser de Barenne, Johannes Gre- 
gorius, Yale Univer., b. 6 June 
1885, d. 9 June 1940. 

Edgell, Beatrice, London, b. 26 Oct. 
1871, d. 10 Aug. 1948. 

Ehrenfels, Christian von, Prague, b. 
20 June 1859, d. 8 Sept. 1932. 

Ellis, Henry Havelock, London, b. 2 
Feb. 1859, d. 8 July 1939. 

Farrand, Livingston, Cornell Univer., 
b. 14 June 1867, d. 8 Nov. 1939. 
Ferenczi, Sándor, Budapest, b. 7 

July 1873, d. 22 May 1933. 

Fernald, Grace Maxwell, Univer. 
California Los Angeles, b. 29 Nov. 
1879, d. 16 Jan. 1950. 

Fernald, Mabel Ruth, Cincinnati, 
Ohio, b. 7 May 1883, d. 9 Oct. 
1952. 

Ferree, Clarence Errol, Johns Hopkins 
Univer., b. 11 Mar. 1877, d. 26 
July 1942. 

Ferrier, David, London, b. 13 Jan. 
1843, d. 19 Mar. 1928. 

Flechsig, Paul Emil, Leipzig, b. 29 
June 1847, d. 22 July 1929. 

Fletcher, John Madison, Tulane Uni- 
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ver., b. 27 June 1873, d. 12 Dec. 
1944. 

Forel, August, Zürich, b. 1 Sept. 
1848, d. 27 July 1931. 

Franz, Shepherd Ivory, Univer. Cali- 
fornia Los Angeles, b. 27 May 
1874, d. 14 Oct. 1933. 

Freud, Sigmund, Vienna and London, 
b. 6 May 1856, d. 23 Sept. 1939. 
Frey, Max von, Würzburg, b. 16 

Nov. 1852, d. 25 Jan. 1932. 

Fröbes, Joseph, Rome, b. 26 Aug. 
1866, d. 24 Mar. 1947. 

Gamble, Eleanor Acheson McCul- 
loch, Wellesley Coll, b. 2 Mar. 
1868, d. 30 Aug. 1933. 

Garth, Thomas Russell, Univer. Den- 
ver, b. 24 Dec. 1872, d. 20 Apr. 
1939. 

Geissler, Ludwig Reinhold, Ran- 
dolph-Macon Women's Coll., b. 
22 Sept. 1879, d. 15 Dec. 1932. 

Gelb, Adhémar, Frankfurt a. M., b. 
18 Nov. 1887, d. 7 Aug. 1936. 

Gilliland, Adam Raymond, North- 
western Univer., b. 5 Oct. 1887, d. 
1 Dec. 1952. 

Goldscheider, Alfred, Berlin, b. 4 
Aug. 1858, d. 10 Apr. 1935. 

Gosset, William Sealy (“Student”), 
London, b. 13 June 1876, d. 16 
Oct. 1937. 

Groos, Karl, Tübingen, b. 10 Dec. 
1861, d. 3 Apr. 1946. 

Grünbaum, Anton Abram, Utrecht, 
b. 23 May 1885, d. 10 Jan. 1932. 
Haggerty, Melvin Everett, Univer. 
Minnesota, b. 17 Jan. 1875, d. 6 

Oct. 1937. 

Haines, Thomas Harvey, Montclair, 
N. J., b. 4 Nov. 1871, d. 2 Mar. 
1951. 

Hamilton, Gilbert Van Tassel, Santa 
Barbara, Calif., b. 15 Jan. 1877, d. 
16 Dec. 1943. 

Head, Henry, London, b. 4 Aug. 
1861, d. 8 Oct. 1940. 

Hecht, Selig, Columbia Univer., b. 8 
Feb. 1892, d. 18 Sept. 1947. 


Henmon, Vivian Allen Charles, L 
ver. Wisconsin, b. 27 Nov. 
d. 10 Jan. 1950. 

Heymans, Gerardus, Groningen, 
17 Apr. 1857, d. 18 Feb. 1930. — 

Hobhouse, Leonard Trelawney, L 
don, b. 8 Sept. 1864, d. 21 Ju 
1929. 4 

Hóffding, Harald, Copenhagen, 1 
11 Mar. 1843, d. 2 July 1931. - 

Hollingworth, Leta Stetter, Col 
bia Univer., b. 25 May 1886, d; 
Nov. 1939. 

Holt, Edwin Bissell, Princeton U 
ver., b. 21 Aug. 1873, d. 25 J 
1946. 

Holway, Alfred Harold, Eastm 
Kodak Co., b. 28 Feb. 1905, d 
June 1948. 

Hornbostel, Erich von, Berlin am 
Cambridge, Eng., b. 25 Feb. 187 
d. 13 June 1935. E 

Horney, Karen, New York City, ! 
16 Sept. 1885, d. 4 Dec. 1952. 

Hull, Clark Leonard, Yale Univer. 
b. 24 May 1884, d. 10 May 199 

Husserl, Edmund, Freiburg i. Br.y! 
8 Apr. 1859, d. 27 Apr. 1938. 22 

Isaacs, Susan Sutherland, London, | 
24 May 1885, d. 12 Oct. 1948. 

Jaensch, Erich Rudolf, Marburg, V 
26 Feb. 1883, d. 1 Feb. 1940. — 

Janet, Pierre, Paris, b. 30 May 185% 
d. 24 Feb. 1947. ; ff 

Jastrow, Joseph, Univer. Wisconst 
b. 30 Jan. 1863, d. 8 Jan. 1944. 

Jelliffe, Smith Ely, New York City 
b. 27 Oct. 1866, d. 25 Sept. 194 

Jenkins, John Gamewell, iv 
Maryland, b. 30 May 1901, d. 3 
Jan. 1948. a 

Jennings, Herbert Spencer, J 
Hopkins Univer., b. 8 Apr. 
d. 14 Apr. 1947. D 

Judd, Charles Hubbard, Univer. í 
cago, b. 20 Feb. 1873, d. 18 Ji 
1946. E 

Kafka, Gustav, Würzburg, b. 4 
July 1883, d. 12 Feb. 1953. — 
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Katz, David, Stockholm, b. 1 Oct. 
1884, d. 2 Feb. 1953. 

Kellogg, Chester Elijah, McGill Uni- 
ver., b. 11 Nov. 1888, d. 9 July 
1948. 

Kiesow, Federico, Turin, b. 28 Mar. 
1858, d. 9 Dec. 1940. 

Kirkpatrick, Edwin Asbury, Fitch- 
burg St. Teach. Coll., b. 29 Sept. 
1862, d. 4 Jan. 1937. 

Kirschmann, August, Leipzig, b. 21 
July 1860, d. 24 Oct. 1932. 

Klemm, Gustav Otto, Leipzig, b.8 
Mar. 1884, d. 5 Jan. 1939. 

Koffka, Kurt, Smith Coll., b. 18 Mar. 
1886, d. 22 Nov. 1941. 

Kries, Johannes von, Freiburg i. Br., 
b. 6 Oct. 1853, d. 30 Dec. 1928. 
Kuhlmann, Frederick, St. Paul, 
Minn., b. 20 Mar. 1876, d. 19 Apr. 

1941. 

Ladd-Franklin, Christine, Columbia 
Univer., b. 1 Dec. 1847, d. 5 Mar. 
1930. 

Le Bon, Gustave, Paris, b. 7 May 
1841. d. 14 Dec. 1931. 

Leuba, James Henry, Bryn Mawr 
Coll., b. 9 Apr. 1868, d. 8 Dec. 
1946. 

Lévy-Bruhl, Lucien, Paris, b. 10 
Apr. 1857, d. 12 Mar. 1939. 

Lewin, Kurt, Mass. Inst. Tech., b. 9 
Sept. 1890, d. 12 Feb. 1947. 

Lindley, Ernest Hiram, Univer. Kan- 
Sas, b. 2 Oct. 1869, d. 21 Aug. 1940. 

Lindworsky, Johannes, Prague, b. 21 

pi 1875, d. 9 Sept. 1939. 
ink, Henry Charles, Psych. Corp., 
b. 27 Aug. 1889, d. 9 Jan. 1952. 
ipmann, Otto, Berlin, b. 6 Mar. 

EM, d. 7 Oct. 1933. 

Ipps, Gottlob Friedrich, Zürich, b. 
6 Aug. 1865, d. 9 Mar. 1931. 

Bn. John Thomson, Cam- 
ridge, Eng., b. 18 Jan. 1886, d. 1 

ub 1947, 

Tol: Robert, New York 
ru: b. 12 June 1866, d. 1 Nov. 


McDougall, William, Duke Univer., 
b. 22 June 1871, d. 28 Nov. 1938. 

McGeoch, John Alexander, Univer. 
Iowa, b. 9 Oct. 1897, d. 3 Mar. 
1942. 

Marbe, Karl, Würzburg, b. 31 Aug. 
1869, d. 2 Jan. 1953. 

Marston, William Moulton, New 
York City, b. 9 May 1893, d. 2 
May 1947. 

Martin, Lillian Jane, San Francisco, 
Calif., b. 7 July 1851, d. 26 Mar. 
1943. 

Maxfield, Francis Norton, Ohio State 
Univer., b. 29 Aug. 1877, d. 10 
Nov. 1945. 

Mead, George Herbert, Univer. Chi- 
cago, b. 27 Feb. 1863, d. 26 Apr. 
1931. 

Meyer, Adolf, Johns Hopkins Uni- 
ver., b. 13 Sept. 1866, d. 17 Mar. 
1950. 

Miner, James Burt, Univer. Ken- 
tucky, b. 6 Oct. 1873, d. 24 Mar. 
1943. 

Monakow, Constantin von, Zürich, 
b. 4 Nov. 1853, d. 19 Oct. 1930. 
Morgan, Conwy Lloyd, Bristol, b. 6 

Feb. 1852, d. 6 Mar. 1936. 

Morgan, John Jacob Brooke, North- 
western Univer., b. 23 Aug. 1888, 
d. 16 Aug. 1945. 

Müller, Georg Elias, Góttingen, b. 20 
July 1850, d. 23 Dec. 1934. 

Myers, Charles Samuel, London, b. 
13 Mar. 1873, d. 12 Oct. 1946. 

Patrick, George Thomas White, Uni- 
ver. Iowa, b. 19 Aug. 1857, d. 21 
May 1949. 

Pavlov, Ivan Petrovitch, Leningrad, 
b. 26 Sept. 1849, d. 27 Feb. 1936. 
Pearson, Karl, London, b. 27 Mar. 

1857, d. 24 Feb. 1936. 

Peterson, Joseph, Peabody Coll., b. 
8 Sept. 1878, d. 20 Sept. 1935. 
Pintner, Rudolf, Columbia Univer., 

b. 16 Nov. 1884, d. 7 Nov. 1942. 

Poppelreuter, Walther, Bonn, b. 6 

Oct. 1886, d. 11 June 1939. 
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Prince, Morton, Boston, Mass., b. 21 
Dec. 1854, d. 31 Aug. 1929. 

Pyle, William Henry, Wayne Univer, 
b. 27 Feb. 1875, d. 3 Mar. 1946. 
Rank, Otto, New York City, b. 22 

Apr. 1884, d. 31 Oct. 1939. 

Ranschburg, Paul, Budapest, b. 3 
Jan. 1870, d. 18 Jan. 1945. 

Richet, Charles, Paris, b. 26 Aug. 
1850, d. 3 Dec. 1935. 

Rignano, Eugenio, Milan, b. 31 May 
1870, d. 9 Feb. 1930. 

Robinson, Edward Stevens, Yale 
Univer., b. 18 Apr. 1893, d. 27 Feb. 
1937. 

Robinson, Florence Richardson, New 
Haven, Conn., b. 3 July 1885, d. 3 
Dec. 1936. 

Rosanoff, Aaron Joshua, Sacramento, 
Calif., b. 26 June 1878, d. 7 
Jan. 1943. 

Rubin, Edgar John, Copenhagen, b. 
6 Sept. 1886, d. 3 May 1951. 

Ruch, Giles Murrel, Washington, 
D. C.,b.7 July1892,d.15 Nov. 1943. 

Sachs, Hanns, Boston, Mass., b. 10 
Jan. 1881, d. 10 Jan. 1947. 

Saudek, Robert, London, b. 21 Apr. 
1881, d. 15 Apr. 1935. 

Schiller, Paul Harkai, Yerkes Lab. 
Primate Biology, b. 4 Nov. 1908, 
d. 1 May 1949. 

Schumann, Friedrich, Frankfurt 
a.M., b. 16 June 1863, d. 10 Mar. 
1940. 

Seashore, Carl Emil, Univer. Iowa, b. 
28 Jan. 1866, d. 16 Oct. 1949, 

Seashore, Robert Holmes, North- 
western Univer., b. 14 June 1902, 
d. 27 Aug. 1951. 

Shand, Alexander Faulkner, London, 
b. 20 May 1858, d. 7 Jan. 1936. 
Sharpey-Schafer, Edward Albert, 
Edinburgh, b. 2 June 1850, d. 29 

Mar. 1935. 

Shepherd, William Thomas, Colum- 
bia Univer. Sch., Washington, 
D. C., b. 4 Feb. 1867, d. 2 Feb. 
1945. 


Sherrington, Charles Scott, Oxford, 
b. 27 Nov. 1857, d. 4 Mar. 1952, 
Shirley, Mary Margaret, Indiana 
Univer., b. 7 Feb. 1899, d. 11 June 
1946. 

Small, Willard Stanton, Univer, 
Maryland, b. 24 Aug. 1870, d. 31 
Jan. 1943. 

Smith, Stevenson, Univer. Washing- 
ton, b. 29 Apr. 1883, d. 27 Nov, 
1950. 

Snoddy, George Samuel, Indiana. 
Univer., b. 21 Apr. 1882, d. 29 
June 1947. 

Spearman, Charles Edward, London, 
b. 10 Sept. 1863, d. 17 Sept. 19 

Starbuck, Edwin Diller, Univer. 
Southern California, b. 20 Feb. 
1866, d. 18 Nov. 1947. : 

Stern, William, Duke Univer., b. 
Apr. 1871, d. 27 Mar. 1938. 

Stetson, Raymond Herbert, Oberlin 
Coll, b. 1 Mar. 1872, d. 4 De 
1950. 

Stoelting, Christian H., Chicago, I 
b. 18 July 1864, d. 18 Mar. 1943. 
Stout, George Frederick, St. An 
drews, b. 6 Jan. 1860, d. 18 Aug. 

1944. 

Stumpf, Carl, Berlin, b. 21 Apr. 1848 
d. 25 Dec. 1936. a 

Sutherland, Arthur Howard, Louist 
ana Coll., b. 19 Nov. 1878, d. 18 
May 1951. ) 

Swift, Edgar James, k 

Univer., b. 24 July 1860, d. 3 
Aug. 1932. 

Tait, William Dunlop, é 

Univer., b. 20 Nov. 1879, d. 10 
May 1945. d 

Thorndike, Edward Lee, Columbia 
Univer., b. 31 Aug. 1874, d. 9 Aug- 
1949. 

Triplett, Norman, Kansas St. Teach 
Coll., Emporia, b. 1 Oct. 1861, € 
16 Oct. 1934. i 

Troland, Leonard Thompson, Har 
vard Univer., b. 26 Apr. 1889, d. 2/ 
May 1932. 


PSYCHOLOGICAL NECROLOGY (1928-1952) 


Twitmyer, Edwin Burket, Univer. 
Pennsylvania, b. 14 Sept. 1873, d. 
3 Mar. 1943. 

Valentine, Willard Lee, Northwest- 
ern Univer., b. 2 Dec. 1904, d. 5 
Apr. 1947. 

Warren, Howard Crosby, Princeton 
Univer., b. 12 June 1867, d. 4 Jan. 
1934. 

Washburn, Margaret Floy, Vassar 
Coll., b. 25 July 1871, d. 29 Oct. 
1939. 

Weiss, Albert Paul, Ohio State Uni- 
ver., b. 15 Sept. 1879, d. 3 Apr. 
1931. 

Wembridge, Eleanor Harris Row- 
land, Los Angeles, Calif., b. 9 Dec. 
1883, d. 20 Feb. 1944. 

Wertheimer, Max, New School for 
Social Research, b. 15 Apr. 1880, 
d. 12 Oct. 1943. 

Wheeler, William Morton, Harvard 
Univer., b. 19 Mar. 1865, d. 19 
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Apr. 1937. 

Whipple, Guy Montrose, Clifton, 
Mass., b. 12 June 1876, d. 1 Aug. 
1941. 

White, William Alanson, St. Eliza- 
beth's Hosp., Washington, D. C., b. 
24 Jan. 1870, d. 7 Mar. 1937. 

Willoughby, Raymond Royce, Provi- 
dence, R. I., b. 20 Apr. 1896, d. 3 
Oct. 1944. 

Wirth, Wilhelm, Leipzig, b. 26 July 
1876, d. 13 July 1952. 

Woolley, Helen Bradford Thompson, 
Columbia Univer., b. 6 Nov. 1874, 
d. 24 Dec. 1947. 

Yoakum, Clarence Stone, Univer. 
Michigan, b. 11 Jan. 1879, d. 20 
Nov. 1945. 

Yule, George Udny, Cambridge, 
Eng., b. 18 Feb. 1871, d. 26 June 
1951. 

Zwaardemaker, Hendrik, Utrecht, b. 
10 May 1857, d. 19 Sept. 1930. 
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The temerity of the textbook 
writer who believes that he has pro- 
duced the answer to general psy- 
chology’s prayer is exceeded only by 
the audacity of the reviewer who 
presumes to pass judgment on the 
belief. More rash than wise is the in- 
dividual who would go still further 
and attempt an evaluative compari- 

1 This is the second special review dealing 
with the evaluation of books in a given area of 
psychology (cf. Editorial Note, Psychol. Bull., 
1953, 50, 149). 
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son of several current texts. It may 


be fun to predict consumer reaction, | 
instructor to 


or to dare some hardy 
edge himself out on 
textual limb. But a more helpful 
function of the review is to present 
the new books in broad context, 
leaving the responsibility for evalu- 
ation where it belongs—with each 
individual teacher. In view of the 
theoretical differences that charac- 
terize psychology, and the diversities 
in philosophy and practice typifying 
the teaching profession, it would be 
folly to expect that any teaching aid 
could be all things to all pedagogues. 

If there is any discernible trend 
running through all higher education 
today, it is a healthy compulsion to 
self-examination. It is interesting to 
speculate that its temporal proximity 
to World War II and continuing in- 
ternational crisis is more than coinci- 
dental. Manpower problems are 
more obviously critical, the success 
of selection and training methods in 
the armed services suggests that par- 
allel problems in the academic field 
might yield to equivalent experimen: 
tal analysis, and perhaps there is 1m- 
patience with a traditional scholar- 
ship that has failed to open the road 
to Utopia. Whatever the explana- 
tion may be, the consequence J 
clear. Spurred on by Presidential 
commissions and educational founda- 
tions, every college institutes some 
new curricular reform, or at least rè- 
names an old one; private and public 
soul-searching is the order of the day: 

In view of psychology's subject 


an untested 


| 
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matter and demonstrated skills, it 
would be anomalous indeed if we 
were unwilling to confront the situa- 
tion in our own bailiwick. While 
much of our extra-laboratory energy 
has been applied to clinical and in- 
dustrial problems, curriculum ra- 
tionale and teaching procedures have 
been receiving increasing amounts 
ofattention. It is becoming gradually 
accepted that graduate departments 
may with profit organize seminars to 
guide their students in the develop- 
ment of instructional proficiency. 
The Division on the Teaching of 
Psychology is growing not only in 
membership but in productive ac- 
tivity. Most important, the recog- 
nition is spreading that criteria are 
fully as elusive in the classroom as in 
pilot training, that sampling biases 
are not automatically eliminated by 
college matriculation, and that peda- 
gogical preconceptions, like experi- 
mental hypotheses, exist to be chal- 
lenged. 


What's Wrong with General Psychol- 
ogy? 

The focus of discussion has been 
the introductory psychology course. 
It might be argued that the dispro- 
Portionate importance of this course 
makes it worthy of greatest effort, for 
it serves the greatest number of stu- 
dents and it is the primary medium 
of recruitment. Further, it probably 
encompasses the problems of all but 
the most technical courses, both 
undergraduate and graduate. And, 
if further excuse is needed, the begin- 
Wie is the logical place to begin. 
: at shall be our objectives in this 

ourse: to increase the student's ap- 
Preciation of the advantages and 
aos of scientific method, to 
pre rish the development of more 

E attitudes toward human 

avior, to enhance general under- 


standing of behavior dynamics by 
delineating basic psychological prin- 
ciples, to give a balanced view of 
what constitutes the scientific disci- 
pline and the professional practice 
called psychology, to help the stu- 
dent solve his emotional and voca- 
tional problems? Should we lecture to 
our classes, or plan directed discus- 
sions, or just sit and exude warm 
acceptance? Is the major assigned 
reading to repeat the class activities 
or is it to cover the topics for which 
no class time is available? Is the num- 
ber of pages apportioned to the sev- 
eral topics to reflect the relative inter- 
est that a sample of unsophisticated 
students have expressed? Is the 
reading to furnish case material for 
group dissection? Is it to add the 
living flesh of research to the bare 
skeleton of topics orally presented? Is 
it to be entertaining and lively, or 
sober and pedestrian? 


A Guide: Buxton et al. 


A partial set of answers comes from 
the symposium on the improvement 
of undergraduate psychology instruc- 
tion, held at Cornell during the sum- 
mer of 1951. Much of the report 
deals with the introductory course. 
It is first assumed that the desires 
and expectations of the students 
should not dictate the nature of the 
course, although they may well. be 
exploited to facilitate the learning 
process. Problems of everyday inter- 
est may furnish the point of depar- 
ture, but the instructor must be sure 
to depart a perceptible distance. 
This is definitely not to be a personal 
adjustment course, and the intellec- 
tual goals must never become second- 
ary to the urge to conduct group 
therapy. The primary teaching 
emphasis in the ideal curriculum is 
to be upon knowledge and content 
(the problems of psychology, facts 
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and principles of psychology, psy- 
chology as science, the structure and 
functioning of science), which should 
then indirectly lead the student to- 
ward the objectives of rigorous habits 
of thought and modified attitudes to- 
ward behavior. 

The study group finds that the 
effectiveness of the usual introduc- 
tory text is impaired by the unfortu- 
nate combination of two divergent 
organizations, and proposes that 
either the developmental or the 
cross-sectional approach be explicitly 
adopted and rather strictly followed. 
In the sample outlines presented, 
less than the traditional emphasis is 
placed upon the topics of neural 
anatomy and physiology, the recep- 
tors, the effectors, tests, individual 
differences, and statistics. While no 
selection and arrangement of topics 
can remain definitive in a changing 
field, the adoption of some guiding 
organization is preferable to a list- 
less eclecticism born of inertia and 
ancestor worship. 


Science and. Practical Living: Hilgard 


Hilgard's basic orientation involves 
two major objectives, which at first 
glance may seem to be incompatible. 
He wishes the text to give a fair view 
of psychology as a whole, and to in- 
troduce the student to “those topics 
which are the centers of excitement 
among professional psychologists to- 
day" (p. ix). At the same time, he 
recognizes as valid the desires of the 
elementary student to receive help 
in increasing self-understanding. To 
reconcile these aims is not impossible, 
he feels. After all, more psycholo- 
gists are actively concerned with 
industrial and clinical matters than 
with retinal interaction. And so it is 
fair enough to devote nearly 20 per 
cent of the book to a section entitled 
"Psychology Applied to Personal and 


Social Problems" (“Mental 
and  Readjustment Technique 
“Vocational Adjustment," ''Psyc 
ogy in Industry," “Public Op 
and Propaganda," ‘Problems of S 
cial Groups"), while omitting tl 
usual chapters on receptor struct 
and processes, the nervous s 
biological genetics, and stati 
Still more significant is the impre 
sion that Hilgard is striving to b 
“person-oriented” in the other 80 p 
cent, most obviously so in the cha 
ters on ''Adolescence and Adul 
hood," "Emotion and Motivation; 
"Social Motives,” ‘Conflict and 
Frustration,” and “Individual Modes 
of Adjustment." He is not simpl 
describing a series of representati 
experiments from the relevant sec 
tions of Psychological Abstracts. © 
is instead analyzing some majo 
problems of behavior in terms unde 
standable to the student, and f 
counting what he has come to think 
about them after all these years as. 
psychologist; experiments are inter- 
polated chiefly to indicate that his 
thinking is not entirely autistic. 

If relegation of the laboratory 
the relatively minor position Y 
please the student, it will just as 
surely disappoint some instructi 
This is first of all a course in science, 
chiefly experimental science. Is } 
possible to preserve the spirit of. 
search in a 16-half-line abbreviatio 
of a 32-page experimental articl 
Can six or seven pages toward t 
end of the book, formally describir 
scientific procedure, adequately 
quaint the student with the fun 
mental problems and limitations 
experimental methodology and it 
culcate in him the desired habits 
rigorous thinking? Will not t 
essay form, the soft pedaling of co! 
troversial data, give a false impr 
sion of ease and certainty in $ 


SPECIAL REVIEW 85 


tific discovery? Is it enough merely 
to state the final polished results of 
investigations? What of the endless 
struggle with experimental controls 
and the slow refinement of quanti- 
tative measurement? Or can we as- 
sume that a competent instructor 
will be an indispensable half of the 
course? (It is almost paradoxical 
that of the four really new texts re- 
viewed, this is the one that least 
needs a constant interpreter and fel- 
low discussant, with its delightfully 
readable and yet mature style.) 


A Unifying Concept: Stagner and 
Karwoski 


It is often suggested that the best 
preparation for the Ph.D. comprehen- 
sive examination is to study an in- 
troductory text. Stagner and Kar- 
woski's Psychology is admirably 
suited for this use. They make no 
attempt to be easy, to skip over 
difficult concepts, to subordinate the 
standard subject matter of psychol- 
ogy to matters of immediate practi- 
cality, or to “write down.” While 
this may satisfy the graduate stu- 
dent's needs, it may by the same 
token discourage the sophomore. If 
the services of the professor are re- 
quired to add rigor to Hilgard they 
will be helpful here to assure under- 
standing and everyday interest. The 
one-semester course based on this 
text can be very solid but will surely 
not be easy. 

à A frequently voiced complaint is 
dm general psychology is a loose 
Tt inging together of disparate topics. 
^s is difficult for the instructor to 
ee for the student an integrated 
Dee unless some unifying con- 
pt is invoked. It is to homeostasis 
ees and Karwoski turn to 
E e foncion, Some degree of 
Bt a, ac ieved by this device. 
e tying together of topics is 


accomplished still more meaningfully 
in another fashion. There is laudable 
willingness to mention a concept out- 
side the chapter or the chapter sec- 
tion where it "belongs." Thus, the 
longish chapter labeled “Perception” 
involves considerations of person- 
ality, motivation, social factors; and 
perception recurs in the discussion of 
problem solving, of remembering, of 
thinking, of intelligence, of person- 
ality. Such contextual cross-refer- 
encing of course fosters continuing 
expansion of the meaning of each 
idea. Other writers, as well as teach- 
ers, would do well to extend this 
demonstration of the basic integrity 
of the behaving organism. 


Revisions: Guilford ; and Asher, Tiffin, 
and Knight 


These two books, based as they 
are on earlier publications, represent 
the least drastic shift from the es- 
tablished pattern of the past twenty 
or thirty years. They will perhaps 
fit most readily into the traditional 
course organization. 

In 1941 Guilford was reported a 
best seller. Unless your favorite ex- 
periment is among those pruned 
away, there is little reason for you to 
like this blue printing any less than 
the red printing. A dozen pages have 
been added on “What Psychologists 
Do,” there is a new 30-page section 
on heredity and development, and 
the rest of the material is presented 
in somewhat modified order. 

Asher, Tiffin, and Knight, in what 
is essentially its third edition, has 
shifted its center of gravity some- 
what toward the right—slightly far- 
ther removed from the strictly ap- 
plied and closer to psychology as 
science. Of the 14 chapters in this 
relatively short book three are new 

“Development of the Individual,” 
“Body Structures and Behavior,” 
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and “Motivation”), and others have 
been partially rewritten. 


A Cross-Discipline Approach: Cole 


It is not always easy to judge 
whether an author's second book in 
a field is different enough from the 
first to require additional shelf space. 
A simple criterion now suggests it- 
selí: if another publisher is involved, 
any similarity between the two is not 
only coincidental but probably ille- 
gal! Human Behavior represents a 
considerable departure from Cole’s 
earlier General Psychology (2) and 
from all other elementary psychology 
texts, in both emphasis and organiza- 
tion. 

The most startling difference is 
found in the concluding section, 200 
pages devoted to the “self-system.” 
After a chapter relating the hyp- 
notic state and the divided self, there 
are five chapters describing major 
aspects of psychoanalytic theory. 
The final chapter, “The Normal 
Personality,” is a 50-page discourse 
which, as Cole suggests, goes beyond 
psychology in answering the “So 
what?” of the entire course. The 
reader will look in vain for the usual 
discussion of types vs. traits, an out- 
line of personality tests, and the clas- 
sification of deviations. Instead he 
will find an original dissertation on 
the philosophical and practical mean- 
ing of “normality,” with an unusually 
varied selection of allusions to liter- 
ature, theoretical psychiatry, and 
theology. It will be a strong dose for 
the student to swallow as he comes 
to the end of his survey of psychol- 
ogy. The average instructor will 
have to be shaken loose from a great 
many scientific fixations if he is to 
regard the treatment as nourishing 
rather than purgative. 

The second distinguishing feature 
is the abundance of documentation 


throughout the book, drawing f 
and at length from experimental 
scriptions, philosophical  treati 
sociological and anthropological exe 
positions, and a wide variety of case 
reports. If Hilgard tends to use illus- 
trations as footnotes to his discussi 
and if Stagner and Karwoski incor 
porate their references more inti- 
mately into the textual materi 
Cole can be said in some chapters to 
be providing the interpretive conti- 
nuity for an extended anthology 
behavior stories. The average 
ginning student will find that a great 
deal of the material makes interesting” 
reading, but he will probably 
find himself in need of considera 
classroom guidance if he is to 
stract the pattern of principles usus. 
ally considered to constitute the core 
of general psychology. In a 

real sense, we have another difficult 
text. S 
Surprise Entry: Skinner 

Inclusion of this item in the re- 
view may be unexpected and even 
controversial. Admittedly it wou 
be impossible if the list were re 
stricted to books deliberately de 
signed for the general psychology 
course or to those deviating no more 
than one sigma from the current 
norm. But even if it is unacceptable 
for widespread classroom adoption, 
its consideration should force us to 
scrutinize a little more closely what- 
ever faith underlies our practice m 
the first course. 

Science and Human Behavior was 
prepared in connection with the Har- 
vard general education course in be- 
havior science. The unhappy condi- 
tion of the world is attributed to & 
failure to resolve, or even to face, a 
basic contradiction between opposed 
conceptions of man. “A scientific 
conception of human behavior dic- 
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tates one practice, a philosophy of 
personal freedom another" (p. 9). A 
wise choice presupposes understand- 
ing of the alternatives; it is Skinner's 
responsibility to clarify the first 
position, by providing a meaningful 
definition of science, by demonstrat- 
ing that human behavior is amenable 
to the manipulations of experimental 
science, and by considering the impli- 
cations for practical living that must 
inevitably emerge. His belief that 
the variables significant for the un- 
derstanding and control of behavior 
lie only in the immediate environ- 
ment and the behavioral history of 
the organism is illustrated in 150 
pages of relatively nontechnical and 
nonquantitative discussion of operant 
behavior. Where the laboratory 
demonstration involves pigeons—and 
it not infrequently does—logically 
parallel examples from common hu- 
man experience are proposed. For 
example, the cumulative effect of 
accidental contingencies in operant 
feeding reminds us of human super- 
stition, the unusual efficacy of cer- 
tain reinforcement schedules seems 
to be relevant to the optimal payoff 
rate of gambling devices, verbal re- 
sponses are modified under the influ- 
ence of discriminative stimuli, imita- 
ami birds have their counterpart in 
uman dancers, punishment tem- 
mun suppresses the rate but 
vm unchanged the rat's or child's 

number of responses. Through- 
out this section, the systematic posi- 
c of the writer is never obscure. 
d to the graduate student: these 
oo should be reprinted as 
vealed.") Almost  Painlesly Re- 
Eee premise that com- 
E Z avior is within the potential 
n of science, the principles of sim- 
ad m are then extended to self- 

ol to thought processes and 


inner events, and to the functionally 
unified system of responses sometimes 
called "the self." The social situation 
is brought within the same frame- 
work, beginning with the two-person 
relationship and relentlessly (per- 
force speculatively) driving on to the 
controlling agencies of government, 
religion, psychotherapy, economics, 
and education. Like Cole, Skinner 
recognizes that when the science of 
behavior is pushed to its ultimate, 
the problem of who is to control 
whom and for what ends runs 
squarely up against some of the most 
cherished conceptions of human life. 
He does not counsel retreat. 

How does this help us in introduc- 
tory psychology? How can we de- 
velop in our students a respect for 
psychology as a quantitative and 
controlled mode of investigation if we 
omit the details of experimental de- 
sign and procedure, if we fail to talk 
in terms of means and deviations? 
Can anything but a distorted view 
result from trying to survey modern 
experimental psychology from the 
confines of the Skinner box? What 
happens to all the time-honored ob- 
servations and principles that we 
have come to know as general psy- 
chology? 

Perhaps the answer is that these 
facts and generalizations are unim- 
portant and soon forgotten, that clear 
perspective can most economically 
be gained from a consistent vantage 
point, and that numbers and gadgets 
are not the essence of science. Surely 
the story of the science-behavior 
marriage in the typical text is by 
contrast to this exposition pale and 
unsubstantial, the attempt to relate 
psychology to other aspects of life 
pitifully tentative. For those who 
feel that this part of our job is the 
most important, a little more of the 
Skinnerian approach would seem to 


88 FRANK W. FINGER 


be in order. If the unsupplemented 
classroom use of Science and Human 
Behavior is precluded by limitations 
of curriculum and clientele, its as- 
similation by the serious teacher is 
most appropriate. 


Trends 


Does the curve pass through 
enough points to permit extrapola- 
tion? 

To summarize the contemporary 
trends in the general psychology 
course would require a very complex 
formula indeed, with about as many 
negative terms as positive. And yet 
some hints may emerge from a com- 
parison of the present set of texts 
with a somewhat earlier reference 
group, such as Woodworth and Mar- 
quis (6), Dashiell (4), Cruze (3), and 
Munn (5). 

Topics. The most obvious differ- 
ence is in the line-up of chapter and 
section headings. There seems to be 
a tendency, though not unopposed, to 
transfer the formal study of statisti- 
cal techniques and the organic bases 
of behavior to advanced courses or to 
other departments. Thus statistics 
covers a total of perhaps six pages in 
the six books; the nervous system as 
a separate topic appears only in the 
two revisions; genes have all but dis- 
appeared from the new books; Hil- 
gard and Cole find it possible to write 
general texts without chapters on the 
receptors. With the addition of de- 
velopment sections to the revisions, 
only Stagner and Karwoski deviate 
in this respect from the reference 
texts. And yet any listing of labels or 
counting of pages must be viewed 
with suspicion; for example, it is 
doubtful that any of the books with 
the development chapter qualify as 
pure representatives of the "develop- 
mental" orientation outlined by Bux- 


ton et al., and it would be incorrect to 
assume that any but Skinner would 
exclude neurophysiological variables 
from psychology. It is more nearly 
correct to say that a trend away from 
compartmentalization has blurred 
certain arbitrary distinctions and en- 
couraged the juxtaposition of items 
formerly separated in writing. So, 
Stagner and Karwoski interject a 
discussion of significant characteris- 
tics of the nervous system at any 
point where it may clarify other 
topics, and Cole calls his approach 
biosocial in spite of his chapter 
omissions. 

While convincing quantitative 
proof may be lacking, I have the im- 
pression that more emphasis is now 
being placed upon personality, and 
especially upon its social determi- 
nants. The self concept has come 
into prominence (Hilgard, Stagner 
and Karwoski, Cole, Skinner) and 
there is no question but that the in- 
sights of psychoanalysis are enjoying 
a much more general acceptance. 
This, together with Hilgard's and 
Guilford's chapters surveying the 
fields of psychological endeavor, may 
suggest that general psychology, like 
the APA, is becoming concerned with 
the profession as well as the science. 

The whimsical order of topics has 
traditionally been a source of variety 
to lighten the yearly chore of teach- 
ing introductory psychology. There 
is in the present sample good evi- 
dence that some measure of stability 
is being approached—at least, de- 
velopment and motivation come 
early, and personality late. 

Specific content and format. More 
deadly to the chronic instructor than 
a fixed list of chapters is the same 
diet of illustrative stories, experi- 
ments, and pictures regurgitated 
from one generation of writers to the 
next. The student may be unaware 
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of the difference, but the jaded 
teacher will be refreshed by some of 
the new dishes cooked up by the 
current authors. Hilgard has been 
the most resourceful in the search 
for new ingredients to satisfy old 
recipes, and Stagner and Karwoski's 
chapter on thinking has a very liberal 
sprinkling of original elements. 
Generosity in the use of pictorial 
material, as well as variation in size 
and arrangement of page, is at once 
an economic, an aesthetic, and a psy- 
chological problem. Some publishers 
must recognize that the principles of 
satiation and of diminishing returns 
are as pertinent as the principles of 
attention, It may be for this reason 
that the use of half-tones is quite 
limited in Cole and in Asher et al., 
and moderate in Stagner and Kar- 
woski, and Guilford. Hilgard, on the 
other hand, has what might be called 
a superabundance, and distraction 
begins to outweigh clarification. It is 
probably helpful to portray in action 
representatives of three extreme 
Somatotypes, and one well chosen 
picture may be more valuable than 
three hundred words analyzing depth 
Perception, but it is questionable that 
4 picture of ten giraffes or a ship un- 
der construction will contribute a 
alf-page worth to the student’s un- 
derstanding of gregariousness or de- 
Moralization. The very obvious 
trend of increasing prices might be 
decelerated without educational loss 
if the criteria of moderation and rele- 
vance were more prominent in the de- 
termination of form. 
b ur mention should be made 
fn E vete bibliography fur- 
iof ilgard. It contains up- 
s of 850 items, given in complete 
rm and keyed to the pages in the 
he — reference is made. 
e. 1s also a glossary of about 500 


Systematic bias. There have been 
very few elementary texts with the 
controversial flavor of Titchener and 
Watson, and our first five suggest no 
unique trend in this regard. Few 
teachers are blessed with systematic 
conscience so strict as to prohibit as- 
signment of any of the chapters (ex- 
cept perhaps Cole's critical expo- 
sition of Freudianism), although as 
usual there will be some choking 
over definitions, distress at internal 
inconsistencies, and puzzlement as 
to how experience is to be examined 
if the instrument of introspection is 
discarded. Skinner is of course not so 
universally inoffensive. With all its 
restrictive disadvantages, this at 
least can be said of such a persistent 
theoretical treatment—that the pa- 
tient knows he has been treated. 
When the student finishes Skinner (or 
vice versa), he will be aware that he 
has been up against something, 
whether good or bad. Too few college 
experiences can similarly be charac- 
terized. 

Teaching aids. It would seem logi- 
cal that the instructor is in the best 
position to decide how a textbook and 
other supplementary devices can op- 
timally be fitted into the course plan. 
However, as a guide for the inexperi- 
enced or preoccupied and as a source 
of new ideas, many writers furnish 
extratextual material. Guilford, and 
Asher et al., include self-test questions 
after each chapter. Separate manu- 
als for instructor and student ac- 
company both Hilgard, and Stagner 
and Karwoski. Their quality indi- 
cates that greater than usual effort 
was expended in their preparation. 
Incidentally, it is in the Stagner and 
Karwoski material that we find the 
only suggestion that laboratory work 
can be a useful part of the introduc- 


tion to psychology. 
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The Moral 


No great perspicacity is required 
to sense the restlessness that pervades 
instruction in general psychology. 
We can't quite agree on what we 
should be doing, or how. The prob- 
lem certainly isn't solved when in 
grammatical confusion we echo the 
old wheeze about teaching students 
instead of psychology. The report of 
the Cornell symposium does not sup- 
ply the final word (cf. 1). Nor should 
we expect any textbook writer to 
solve the riddle for us. It is not 
enough, for example, to adopt a text 
whose preface solemnly asserts that 
it is "student oriented" and which 
thereafter departs from the old mold 
only by addressing the reader as 
“you” instead of “one.” 

The answers must be found in the 
course itself, not in the auxiliary aids 
provided. There are a dozen texts 
that are still adequately up to date 
and that can be used with profit by 


the capable class and the thoughtful 
instructor. No one of them can as 
sume the responsibility for teaching. 
(To add to the cliché, you can't 
“teach a book," either.) Indeed, if a 
choice were forced as to which locus 
of student-orientation is the more 
critical, it should obviously be the 
day-by-day activity of the instruc- 
tor, with the printed supplement 
simply a sound and representative 
description of psychological science. 
There are many ways of teaching 
effectively; it is up to each instructor 
to generate some hypothesis as to 
what procedures and what adjuncts 
are most fruitful, and then test and 
revise and retest the hypothesis ad 
infinitum. The diversity of textbook 
offerings is most fortunate, if it has 
the effect of forcing the instructor 
into a questioning and experimental 
frame of mind. The crucial self- 
examination in higher education is 
after all at the level of the individual. 
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CLARK L. A behavior system: 
introduction to behavior theory 
mcerning the individual organism. 
Haven: Yale Univer. Press, 


—In the preface to this book, written 
ortly before his death, Professor 
ull places it as the second in a three- 
plan. The first volume, 
les of Behavior, appeared in 


iate the deductions of a natu- 
ce theory of behavior." The 
lent of the present work, as the sec- 
d volume, is "to show the applica- 


on of the simpler phenomena char- 
zing the behavior of single or- 
s." The plan called for a third 


ime principles to the deduction of 

: elementary phenomena of social 
or, i.e., of behavior manifested 
en the interacting objects are 
immalian organisms of the same 
cies.” Professor Hull foresaw 
lat he would probably not live to 
rite the third volume. In some 
se this second volume is his final 
lest to the science of psychology. 
“AS Such, it deserves sober review as 

one achievement of more 

20 years devoted to this type of 
i atic formulation. 

1 fairness to Professor Hull, it 
be remembered that this work 
Written during his years of failing 
» When he could work but a few 
a day, and when he could not 
„âS formerly, on an active and 
tinuing intellectual interchange 
his colleagues and students. 
the critic may detect incom- 
#nesses and contradictions which 
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perhaps ought not to be there, yet 
which would not have been there had 
Professor Hull been able to complete 
this with the same painstaking care 
and meticulous revision which char- 
acterized his earlier published papers 
and books. This book should be re- 
viewed in the spirit of one seeking to 
find out what can be learned from it 
by way of both method and content; 
it would be improper to concentrate 
upon errors of detail. Professor Hull 
did not believe that he had said the 
last word. His hope, expressed in the 
preface and in the conclusion, is that 
the method exposes its own errors, so 
that serious students can carry for- 
ward the work of making succeeding 
systems more precise. 

Those who know the history of theoretical 
psychology will understand that the present 
system is merely the most recent of a series. 
of miniature systems evolved by the present 
writer. The coming generation of scientists 
will, it is hoped, present other theoretical 
systems, each succeeding one of a progres- 
sively more precise and quantitative nature 
(p. 353). A 

I shall attempt to deal, first, with 
some of the more ingenious sub- 
stantive contributions found in the 
book, second, with its systematic 
and methodological significance, and, 
third, with a general estimate of Pro- 
fessor Hull’s place in the history of 
twentieth-century psychology. 


Substantive Contributions 


Those who were troubled by the 
limitations of topical coverage in 
Principles of Behavior will be pleased 
to find in the present volume many 
of the topics from the papers which 
appeared in the Psychological Review 
in the early 1930's, including be- 
havior in relation to objects in space, 
multidirectional maze learning, the 
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problem-solving assembly of behavior 
segments. 

In planning his deductive system, 
Hull made many early forays into 
more complex behavior, including the 
social behavior that he intended to 
treat in the final projected volume. 
But then he retraced his steps, so 
that, beginning with revised postu- 
lates, he could move from simple 
to more complex behavior. In Prin- 
ciples of Behavior he was concerned 
chiefly with the system of postulates, 
and so chose to deal quantitatively 
with behavior near to that formu- 
lated in the postulate system, with a 
minimum of what he called '*mul- 
tiple-link" deductions. The revised 
postulates were published in a little 
volume, Essentials of Behavior (1951), 
which chiefly brought the postulates 
in Principles of Behavior up to date 
as a background for the second 
volume. Now in A Behavior System 
he reviews, revises, and extends 
many of the deductions of more com- 
plex behavior which he had toyed 
with before the postulate system was 
well formulated. Hence, this new 
volume recovers some of the richness 
of the early papers which brought his 
work to the attention of psycholo- 
gists interested in many aspects of 
learning remote from classical and in- 
strumental conditioning. 

As a brief introduction to some of 
the substantive achievements, I 
would direct the reader to the follow- 
ing six deductions, with the corre- 
sponding empirical verification: 

a. Response alternation in trial-and- 
error learning. The reference experi- 
ment is one in which a rat can be re- 
inforced by moving either a hori- 
zontal or a vertical bar. When both 
bars have been reinforced, and then a 
shift in reinforcement is made from 
the more strongly and recently re- 
inforced bar to the other, there ensues 


a period of alternation before the - 


response is given more frequently to. 
the (now) regularly reinforced bar, 
The systematic relationships pre- 
dicted from the theory (Figure 11, 
p. 48) are borne out empirically with 
some qualifications (Figure 15, p. 52). 

b. Generalization gradients to 
stronger and weaker stimulus in- 
tensities. The theoretical differences 
in curvature of the gradients (Figure 
28, p. 82) is borne out empirically 
(Figure 29, p. 83). 

c. Latent learning. The shifts in 
reaction with shifts in incentive found 
in the early studies by Tolman and 
his collaborators, demonstrating 
latent learning, are now predicted. 
The prediction is shown in Figure 37, 
p. 144, the empirical material in 
Figure 38, p. 145, and Figure 39, 
p. 146. 

d. Reaction latency at points in a 
behavior chain. It is predicted that 
for a four-unit chain, reaction la- 
tencies will decrease for three units 
and increase for the final unit. The 
prediction is shown in Figure 42, p. 
163, the empirical verification 1n 
Figure 43, p. 164. 

e. Effect of motivation upon be- 
havior in a conflict between an adient 
and an abient object. The discussion 
and derivations are found on pP 
245-252, culminating in Theorems 88 
and 89, both of which are verified in 
the empirical outcomes of Table 32, 
p. 253. . 

f. Entrances into goalward-pointing 
alleys vs. those pointing away from 
goal, in a multidirectional maze. The 
prediction of greater frequency 9 
entrances into goalward-pointing 
blinds is summarized in Theorem 
112, p. 289, the empirical data 1n 
Table 35, p. 291. 

Professor Hull has made his ow? 
computation as to agreement 
tween prediction and empirical find- 


* 
p 


` 


—— 


ing. His figures are summarized in 
Table 1. 

Curiously enough, the result would 
be somewhat more convincing were 
the score poorer, rather than better. 
The success is not so convincing be- 
cause too many of the derivations are 
wery close to the data predicted, 
representing almost a working back- 
ward from what is known. 


TABLE 1 


EMPIRICAL AGREEMENT WITH THEORETICAL 
Propositions (From Hull, A behavior 
system, pp. 351-353) 


e — ————————— 


Propositions 4 “ai 
All theoretical propositions 
Evidence found bearing on 
validity o 5 
irect evidence only 30 17 
Subtotal 123 69 
Á Not covered by known rele- 
yant evidence 55 31 
Total 178 100 
Propositions covered by direct 
or indirect evidence 
— Substantially validated 106 86 
bly valid; consider- 
able uncertainty 14 11 
Definitely invalid 1 1 
Not classified 2 2 
Total 123 100 


Working backwards is not, in it- 
self, a defect in a theoretical system, 
Provided the system is coherent and 
internally consistent, and provided it 
also is fertile in predicting relation- 
ships not yet explored. As Hull 
Points out, the 55 predictions not 
Covered by known relevant evidence 
vill provide a good test, for these 
predictions are free of the charge of 
Working backward from the known. 

le range of coverage of the sys- 
tem is sufficient to test its systematic 
and deductive character, for the pre- 

Ictions cover many kinds of be- 


Avior not referred to directly in the 
postulates, " 
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Methodological and Theoretical Con- 
tributions 

Hull makes ingenious use of the 
hypothetico-deductive system, in 
which quantitative treatment moves 
back and forth between basic prin- 
ciples, or postulates, and empirical 
consequences, expressed in corol- 
laries; no one can seriously doubt the 
usefulness of such a model as that 
which he has furnished. The method 
is perhaps best elaborated as an ex- 
ercise in logic and mathematics in 
the earlier multiple-author volume 
entitled Mathematico-Deductive The- 
ory of Rote Learning (1940). The sim- 
ilar approach, but with a different 
postulate system, in Principles of Be- 
havior and now in A Behavior System 
is a model which will endure in the 
history of psychology. 

The basic condition of learning, 
according to Hull, is repeated rein- 
forcement which comes about through 
drive reduction. Some very impor- 
tant changes took place in Hull's 
quantitative theory of reinforcement 
between Principles of Behavior (1943) 
and A Behavior System (1952). While 
a complete discussion of these changes 
would require many pages, some of 
the main changes may easily be 
noted. 

Habit strength (sz) in the earlier 
book increased with reinforcement. 
The amount of increase per reinforce- 
ment depended upon (a) the magni- 
tude of need reduction per reinforce- 
ment, (b) the delay in reinforcement, 
and (c) the interval between the con- 
ditioned stimulus and the response 
to be conditioned to it. In the new 
book habit strength is solely a func- 
tion of the number of repetitions of 
a closely associated stimulus and 
response, provided this S-R conjunc- 
tion is reinforced. The amount of 
reinforcement, and the time relation- 
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ships, no longer affect sHz. Hence 
the basic learning condition becomes, 
in fact, much closer to that proposed 
in the familiar views of Guthrie and, 
more strikingly, of Tolman, because 
the basic learning process depends 
upon contiguous S-R relationships, 
relatively independent of motivation. 
The difference is preserved in that 
some minimum of reinforcement 
must occur. (This necessary mini- 
mum is nowhere made clear in the 
postulates or the text.) The main 
advantage of this change in relation 
of sHg to need reduction lies in the 
possibility of deducing latent learn- 
ing—the empirical basis for Tolman's 
views in the first place. 

What habit strength (sHg) loses, 
reaction potential (sEr) gains. The 
distinction between habit strength 


and reaction potential is essentially . 


the distinction between learning and 
performance, early emphasized by 
Lashley and Tolman. Habit strength 


is energized into actual behavior only 


by being converted to reaction po- 
tential through the effect of drive 
(D), stimulus-intensity dynamism 
(V), and incentive motivation (K). 
The delay in reinforcement (J) also 
affects the size of reaction potential 
which can lead to response at some 
link in the behavior chain. The 
drive (D) was familiar in the 1943 
version, but (V) is new and (K) plays 
a new role. The stimulus intensity 
dynamism (V) refers to the strength 
of the signalling (conditioned) stimu- 
lus. A stimulus of greater intensity 
will evoke a greater response, even 
though sZ remains constant. In- 
centive motivation (K) refers to the 
quantity of incentive used in earlier 
trials. It affects sEr, but not sHr. 
(There is no reference in the postu- 
lates to the size or nature of the 
perceived incentive in adient or abient 
situations, an important variable in 
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much motivated learning.) 
With increasing emphasis 
secondary reinforcement, the p 
differences between primary 
secondary reinforcement become im 
portant. In 1943 it is said that pr 
mary reinforcement is the result 
diminution of a need or drive, 
though elsewhere it is said that 
the drive stimulus Sp (not D) thati 
reduced. This latter statement i 
the one adopted in 1952, with 
addition of the possibility that 
duction in Sg (the stimulation 
sequence of the fractional anticij 
tory goal response) is reinforcing. ^ 
If stimulus termination (rath 
than need reduction) is the basis 
reinforcement, then secondary 
forcement is more readily accou 
for; it becomes, in 1952, a coro 
primary reinforcement rather 
separate postulate. (There 
fact two corollaries of primary 
forcement, one describing secon 
motivation as the substitution 
associated neutral stimulus as 
condition for a drive, the other 
scribing how a previously neut 
stimulus becomes, through ass 
tion, capable of acting as a reinfo 
agent) I find a certain glibnes 
the announcement of these 
laries. That something of the 
occurs in learning is clear enough, 
precisely what occurs is the foci 
controversy between those who favor 
cognitive theories over reinforce 
theories. Any blurring at this 
is therefore unfortunate, for it 
to claims of successes in predicti 
through a kind of tour de fore 
which any conditions which le 
learning must necessarily involve 
drive stimulus, and the reduction 
the drive stimulus by an ass 
reinforcing agent. All that needs 
be shown is a casual association 
some feature of the environ 
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with aroused drive in the past, and 
of some feature with reduced drive in 
the past. The actual linking of 
stimulus and response intermediaries 
is often not demonstrated, only their 
possibilities. Such possibilities exist, 
of course, in any sign-significate rela- 
tionship, so that sign learning ap- 
pears to be easily deduced from these 
principles. Because only some mini- 
mal amount of reinforcement (pri- 
mary or secondary) is necessary in 
developing habit strength, the case 
for a reinforcement theory is made 
even more plausible. This reinforce- 
ment theory requires scarcely any 
reinforcement, and even that little 
may be secondary Such a theory is 
not likely to be successfully refuted, 
even though it may be false. 

It is to Hull’s credit that he has 
searched about for formulations to 
take into account such demonstrated 
facts as came to his attention. He 
was willing to make quite radical 
modifications in his sytem, as here 
illustrated by the shift in properties 
between sHp and sEr. He struggled 
with the nature of primary and sec- 
ondary reinforcement. As Spence 
has pointed out, these modifications 
which make a great deal of difference 
in relation to certain controversies 
among learning theorists actually 
make rather little difference in the 
usefulness of a system of this kind, 
for tather simple modification in the 
equations commonly takes care of 
quite different theoretical interpreta- 
tions, 

There is a good deal more of 
straight empiricism in Hull's method 
m meets the eye. By comparison 

ith some of the newer mathematical 
and probabilistic models it is not very 
mathematical at all, relying heavily 
ES does upon curve-fitting. This is 

| some ways a virtue, for these em- 
Pirical relationships, if derived from 


careful experiment, will survive a 
good deal of tampering with basic 
assumptions about reinforcement. 


Hull's Place in Twentieth-Century 
Psychology 


Hull worked out in detail much 
that was implied in Watson's be- 
haviorism. Taking Pavlov's experi- 
ments as a source of his initial postu- 
lates, he started, or gave an impetus 
to, a new kind of system-building in 
psychology. The older psychological 
systems, represented in the "schools" 
of psychology, were largely program- 
matic, representing ways of defining 
the field, classifying phenomena, 
selecting units, and stating certain 
gross "laws" or principles by which 
the field could be ordered. While 
Hull's systematic bias was that of one 
of these schools (behaviorism), his 
method was simply that of scientific 
logic, in which there is an interplay 
between empirical data and rational 
ways of dealing with such data. The 
systematic formulations lead to pre- 
cise quantitative predictions, so that 
the system is, to some extent at least, 
self-correcting. This is the novelty 
that Hull contributed: a system at 
once fertile in its predictions, and pre- 
cise enough to be vulnerable to ex- 
perimental attack. Because he used a 
good deal of curve-fitting along the 
way, the theory hasa closer affiliation 
with functionalism than some of the 
newer probabilistic models which are 
more rational (less ad hoc) than his. 

There is an elegance in the effort 
that Hull made to develop a limited 
set of postulates to lead, by strict 
derivation, to theorems and corol- 
laries subject to experimental test. 
He held up Newton's mechanics as. 
the ideal, where a few laws of motion 
could account for the orbits of the 
planets, the tides, and the path of a 
projectile. He did not succeed, as 
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Newton did, but perhaps it is instruc- 
tive to remember that Newton, too, 
has been superseded. Hull has set a 
pattern richly influencing psychol- 
ogy today. He has succeeded in mak- 
ing a permanent place for himself in 
the history of psychology. 
Ernest R. HILGARD. 
Stanford University. 


VERNON, M. D. A further study of 
visual perception. New York: Cam- 
bridge Univer. Press, 1952. Pp. 
xi--278. $7.00. 


This book, although similar to the 
author's Visual Perception published 
in 1937, is not a revised edition but a 
new book. It is the nearest approxi- 
mation in print to a comprehensive 
review of experiments on perception 
as traditionally distinguished from 
sensation. That is to say, it omits 
the evidence from sensory physiol- 
ogy, optics, and ophthalmology, and 
concentrates on experiments concern- 
ing the perception of form and pat- 
tern, movement, the perceptual con- 
stancies, individual differences in 
perception, and the facts of attention, 
attitude, and motivation. There is 
little overlap, for instance, with 
Bartley's Vision, and it covers some 
material omitted from Boring's Sen- 
sation and. Perception in the History of 
Experimental Psychology. It might be 
said to be a book about nonstimulus 
delerminants of perception. The ex- 
perimental literature as so defined is 
voluminous, scattered, and contra- 
dictory. The facts as such are hard to 
systematize since they accumulated 
in the course of theoretical contro- 
versies. The author has surveyed a 
considerable part of this evidence by 
managing to cover over 500 studies 
in 260 pages. She has included not 
only the German but the French ex- 
perimenters. The most valuable fea- 
ture of the book may be that it will 


cause investigators to look up a great 
many references they might other- 
wise have neglected. 

The work of Michotte and his stu- 
dents on the frontier between psycho- 
physics and cognition is little under- 
stood in this country and Vernon's 
chapter on these experiments should 
prove especially useful. No compa- 
rable account exists in English. There 
are also good reviews of the percep- 
tion of movement, of the time error 
in psychophysical judgment, of color 
constancy, and of the problems allied 
to attention—a word which Vernon 
is not afraid to use despite the con- 
troversies over its meaning. 

Vernon's debt to Professor Bartlett 
of Cambridge is acknowledged at the 
outset, and her theoretical approach 
is allied to his. She appears to make 
approximately the following assump- 
tions: first, that the perceived field 
is radically unlike the stimulus field— 
the percept is “immensely modified" 
as compared with the sensory stimu- 
lation; second, that sensory impulses 
are the raw material for perception— 
percepts have to be constructed; 
third, that the individual constructs 
his perceptual world in accordance 
with his past experience and his per- 


-sonality—his attitude toward an ob- 


ject is an intrinsic part of his per- 
ception of it. The nature of this 
construction process is, then, the 
crucial question for theory. A good 
many writers in addition to Vernon 
are struggling with this question to- 
day, especially those who hope to 
find in the process a key to the per- 
sonality of the perceiver. So long aS 
one assumes that perception is base 
on sensation it is an inescapable 
question. Vernon shares with other 
contemporaries a certain disillusion 
with the explanation by Gestalt or- 
ganization, which seems less con- 
vincing to her than it did in 1937. 
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Nevertheless she, like others, uses 
the terminology of organization 
theory and has not explicitly rejected 
it. In the end she concludes that we 
do not yet understand the inner 
directing tendencies or schemata 
which shape our percepts. 

The failure of the evidence to 
cohere, however, may be the fault of 
the basic assumptions under which 
it was gathered. “Perhaps the con- 
struction process in perception is hard 
to discover because it does not exist. 
Perhaps the objective determination 
of perception and the subjective de- 
termination of perception cannot be 
mixed in a single theory of normal 
perception; they are incommensu- 
rable and what is required is a sepa- 
rate theory for each. Perhaps we 
must distinguish between Literal and 
schematic perception. The reviewer 
has proposed a special theory of the 
first type but Vernon disagrees, be- 
lieving that perception is always 
schematic, and has added an ap- 
pendix defending her emphasis on 
subjective determination. 

James J. GIBSON. 

Cornell University. 


SmwEIDDMAN,  EpwiN S., JOEL, 
WALTHER, & LITTLE, KENNETH B. 
Thematic test analysis. New York: 
Grune & Stratton, 1951. Pp. xi+ 
320. $8.75. 


Shneidman's primary purpose in 
Thematic Test Analysis was to com- 
pile à comprehensive manual of the 
ciiam scoring schemes now in use 
E analyzing TAT stories and to 
<a the application of such 
pos to a given clinical case. A 
t ondary purpose was to compare 
a relative merits of the TAT and 
E ARS test for the prediction of 
: us behavior. To these ends he 
nlsted the aid of fifteen TAT 
Methodologists, requesting them to 


score and interpret the TAT and 
MAPS stories of one John Doe. A 
separate chapter is devoted to each 
of the analyses. Behavioral data, 
psychiatric observations, and other 
test data—Wechsler-Bellevue, Ror- 
schach, Bender-Gestalt, Minnesota 
Multiphasic, and Draw-A-Person— 
are included for the reader but were 
not generally available to the in- 
dividual analysts. The book ends 
with a brief synthesis and summary 
by Shneidman and his collaborators, 
Joel and Little. The final summary 
chapter presents a résumé of the 
different approaches and their clas- 
sification into five categories: norma- 
tive (Cox and Sargent, Eron, Hart- 
mann, Klebanoff), intuitive (Bellak, 
Holt, Lasaga, Rotter and Jessor, 
Symonds), interpersonal (Arnold, 
Joel and Shapiro, White), and percep- 
tual (used by none as a primary tech- 
nique, but by several as a secondary 
technique). Tabulation and quanti- 
tative analysis are left for a second 
book. Shneidman concludes with 
two “rather striking impressions” 
(p. 307): (1) The various clinicians 
show “remarkable” agreement with 
each other (p. 307), and “they corre- 
late quite well with the behavioral 
data” (p. 303). (2) There is a com- 
mendable spirit of objective inquiry 
and personal humility prevailing 
among clinicians; they are willing “to 
take a position, right or wrong and 
to run the risk of public scrutiny” (p. 
307). Neither of these conclusions 
will satisfy the scientifically oriented 
reader. To correlate "quite well" is 
statistically meaningless, especially 
in the absence of quantification. “To 
run the risk of public scrutiny” is 
nothing new to the scientist. The 
book in its present form serves pri- 
marily as an illustration of how 
various clinicians would score the 
same set of thematic productions. 
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Much of the real work of synthesis 
and most of the evaluation is left to 
the reader. 

Granted that a great deal can be 
learned from a careful study of many 
different approaches to scoring the- 
matic productions, how are we to 
evaluate the contributions made by 
this particular compilation of mate- 
rials? For the student who wishes to 
become proficient in a given scoring 
system, not enough details are given 
by most of the analysts. Some score 
all of the stories, others only one or 
two. Moreover, many of the schemes 
depend so heavily on the intuition 
and background knowledge of the 
analyst that very little can be gained 
from the sketchy working notes 
which have been included. What we 
have here would be better termed a 
brief survey of scoring schemes, not a 
manual. 

A more basic problem for evalua- 
tion centers on Shneidman's own 
methodology. Here the toughminded 
experimentalist will find much to 
trouble him. He could fairly object 
to the lack of scientific rigor in the 
research plan and to the plan it- 
self. Just what, he could ask, is being 
examined here? Is the ultimate ob- 
ject of the research to test the 
relative usefulness of two different 
sets of projective materials, TAT and 
MAPS, for the blind diagnosis of one 
John Doe; or are the relative merits 
of the several different scoring 
schemes per se the major concern? 
Neither purpose is adequately served 
by the data assembled for analysis. 

The data, stories told by a 25-year- 
old unmarried male patient to eleven 
TAT pictures and seven MAPS 
settings, were not obtained under 
comparable conditions. The MAPS 
test was administered before insulin 
shock therapy was undertaken, the 
TAT two months later by a different 


examiner. Behavioral data on Jols 
Doe were «made available to ome 
scorer, at her request, not to 
others. What the effects of these 
differences are is an unknown worthy 
of experimental study. Yet the 
analysts were apparently unaware 
that such differences existed. 
Another serious shortcoming iB 
that individual scoring schemes were 
devised for the TAT, but not for the 
MAPS test. Many of the analysts 
recognize that their individual scheme 
is not entirely suited to the MAPS 
test. What profit is there, then, in 
forcing the stories into an ill-fitting 
mold? Again, most of the analysts. 
are plagued by the absence of ade 
quate norms for the TAT. For the: 
MAPS they have no norms at all. 
Some indicate that they are attempt- 
ing to establish norms for the TAT. 
Others are content merely to apply 
their own clinical knowledge to the 
interpretation of John Doe's stories. 
For communicating a scoring scheme 
to others, of course, this has serious 
limitations. 
The problem of quantifying the re- 
sults of the analyses is further com- 
plicated by the nature of the in- 
dividual analyses since the schemes 
vary from fairly precise scores b 
on checklists (e.g., Fine) and rating 
scales (e.g., Eron, Hartmann, Kle- 
banoff) to intuitive, impressionisti¢ 
interpretations (e.g., Bellack, Rotter 
and Jessor). Some of the analysts 
note regretfully that Shneidman did | 
not present the entire TAT or MA T 
test for their analysis, thereby in 
some cases eliminating pictures whi 
the individual analyst considered im- 
portant. r 
And what of the “supplementary 
tests” and the psychiatric data whi 
are included presumably for pre 
diction checks? Three of the tests 
were given before insulin therapy» 


— 
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two after it. Moreover, how are 
these tests really to be used? If they 
are to serve as validation data to 
evaluate the relative merits of the 
TAT and MAPS are we ready to 
accept the Rorschach or the Draw-A- 
Person, for example, as a validating 
instrument for the TAT or MAPS in 
the same way that we accept the 
Wechsler-Bellevue 1Q as a measure 
against which to test estimates of 
John Doe's intellectual capacities as 
revealed by the TAT? These are 
questions which clinicians cannot 
answer without very precise experi- 
mental study. 
On the other hand, if the psychiat- 
ric material is to be used as the 
validation data, here, too, Shneid- 
man's material is poor. Five different 
psychiatrically trained workers in- 
terviewed John Doe from time to 
time. Some were social workers, 
others were medical men. The psy- 
chotherapy undertaken with John 
Doe was apparently sporadic. He 
Seems to be one of those patients who 
wander in and out of hospitals and 
outpatient clinics, who are seen by 
many people for longer or shorter 
Periods of time, and for whom no sys- 
tematic treatment plan evolves. 
How seriously do these method- 
ological factors limit the usefulness 
of these materials? ^ Shneidman 
acknowledges that they do consti- 
tute "a limitation upon the possible 
Conclusions” (p. 4). ‘This investiga- 
gl he continues, "is a research 
ut not an experiment; it employed 
Systematic observation, not con- 
trolled experiment." This admission 
i ly excuses the lack of scientific 
a To be sure, clinical settings 
èn do set limits on a research 
sign. John Doe often cannot be 
pe Yet test material on 
n oe accumulates in the files be- 

se of the diagnostic demands of 


the hospital setting. The mere avail- 
ability of case material, however, 
does not in itselí justify its extensive 
manipulation for publication. 

One final approach to this material 
remains for consideration: What does 
it reveal about the present state of 
thematic tests? Taking these fifteen 
scoring techniques as representative 
of the manner in which the TAT is 
now being used, it is clear that much 
has been added to the original need- 
press scoring scheme of Murray and 
Sanford even by those who retain 
some of the basic structure (eg. 
Aron and Holt) Yet the funda- 
mental problem of the meaning of 
"projection" has not been resolved. 
What level of personality do these 
tests tap? How much effect does the 
stimulus (TAT picture, MAPS set- 
ting, inkblot) have on what is re- 
vealed? Are these tests primarily 
useful as diagnostic tools or are they 
better for revealing underlying dy- 
namics? The experts do not yet 
agree. Most of the analysts end up 
with the diagnosis of schizophrenia 
for John Doe. Some are of the 
opinion that the MAPS test shows a 
healthier pattern than does the TAT 
(Klebanoff, p. 131), others find even 
clearer evidence of pathology in the 
MAPS test (Arnold, p. 36; Korchin, 
p. 142). Some regard the MAPS test 
as better for diagnostic purposes, the 
TAT for the dynamics. Some stress 
the absence of guilt (White, p. 198), 
others find striking evidence of guilt 
(Bellak, p. 51). Since all of the 
analysts have had considerable ex- 
perience as clinicians, the differences 
between them stem, presumably, 
from differences in scoring rationale. 
To resolve these differences research 
is indeed needed. Given the present 
materials the reader could choose 
between these schemes largely in 
terms of personal preference for the 
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normative vs. the intuitive approach, 
for example, or for present ego- 
functioning vs. an approach which 
emphasizes reconstruction of the 
past. He could not choose in terms 
of which is the most valid technique. 
THELMA G. ALPER. 
Wellesley College. 


GELDARD, FRANK A. The human 
senses. New York: Wiley, 1953. 
Pp. x+365. $5.00. 


This is a general textbook of the 
senses—a textbook of human sensory 
psychophysiology in the complete 
sense of these words. Its 323 pages 
of actual text are divided into 15 
chapters. The first, 13 pages in 
length, is introductory. The next 
four, totaling 80 pages, are concerned 
with the sense of sight (The Visual 
Stimulus and the Eye; Basic Visual 
Phenomena; Color Vision and Color 
Blindness; and Visual Acuity, Con- 
trast, and Interaction). Hearing is 
discussed in three chapters (Sound 
Energy and the Ear, Auditory Phe- 
nomena, and Electrophysiology and 
Auditory Theory), occupying 64 
pages. The skin senses are allocated 
about as much space as vision and 
hearing, three chapters totaling 75 
pages (The Skin and Its Stimuli, 


Pressure and Pain, Temperature 
Sensitivity). Kinesthetic and Or- 
ganic Sensibilities, Labyrinthine 


Sensitivity, The Sense of Smell, and 
The Sense of Taste, are the headings 
for the last four chapters, which 
contain 16, 21, 25, and 29 pages, 
respectively. The text is followed by 
an excellent list of 330 references and 
an unusually detailed index for a 
book of this size. The latter is 25 
pages long and must contain well 
over 2,000 entries. 

Perhaps the most striking thing 
apparent in the statistical descrip- 
tion above is the relative amount of 
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space allocated to vision and hearing 
as compared with the other senses. 
Geldard's treatment of the former 
occupies a scant third of the entire 
book. Boring, Langfeld, and Weld 
(Foundations of Psychology), Munn 
(Psychology), and Stevens (Hand- 
book of Experimental Psychology), on 
the other hand, assign vision and 
hearing the greatest prominence; in 
all three cases these two senses are 
allocated almost exactly 75 per cent 
of the space devoted to sensory mate- 
rial generally. Geldard's book, 
though unbalanced, represents a rea- 
sonable distribution of emphasis since 
there are several good textbooks con- 
cerned exclusively with vision and 
hearing, and scarcely any with the 
other senses. 

As regards the general level of his 
writing, Geldard lies between Boring, 
Langfeld, and Weld, or Munn, on 
the one hand, and Stevens, on the 
other. The Human Senses contains 
considerably more advanced and de- 
tailed material on the senses than one 
finds in the typical good introductory 
text, but it is much less exhaustive 
and scholarly than Stevens’ work. 
Taking Boring et al. and Stevens as 
the two ends of a scale, I should say 
that Geldard lies closer to the former 
than the latter. In fact, the chances 
are that the experimental psycholo- 
gist who has kept reasonably abreast 
of recent developments will find 
Geldard's treatment a little too ele- 
mentary for his taste. : 

Geldard's exposition is lucid 
throughout and his position is gen- 
erally eclectic as regards sensory 
theories. I found no glaring errors to 
comment upon, or outrageous points 
of view to quarrel with. My chief 
complaint is rather that he com- 
mitted too many errors of omission. 
However, if you take this book for 
what it is—an intermediate-leVi 
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text with a sound, general perspec- 
tive—you will have to agree that it 
fills a definite gap in the psycholo- 
gist's library. 
A. CHAPANIS. 
Bell Telephone Laboratories 
Murray Hill, New Jersey. 


Nortuway, Mary L. A primer of 
sociometry. Toronto: Univer. of 
Toronto Press, 1952. Pp. vi+48. 
$2.25, cloth-bound; $1.50, paper- 
bound. 


A sociometric test measures the 
spontaneous choices of the members 
(or would-be members) of a group 
for and against associating with one 
another. Obviously, it must there- 
fore employ an appropriate criterion 
for the choosing. Thus, in work 
groups, this would be ''to be in the 
same work unit," ‘‘as neighboring 
workers on an assembly line"; in 
hospitalized groups, this could be 
to be on the same ward,” etc. Once 
the patterns of interpersonal response 
are uncovered, the meaning of the 
choices for the individual himself, 
for the functioning of the group, for 
the importance they have in the areas 
of Social and clinical psychology, 
Sociology, and anthropology, are the 
Concern of sociometry—namely, un- 
der what conditions of group life and 
the personal past life of individual 
members will such and such patterns 
E me intricacies of the pat- 
ihe an the complexities of their 
B oretical implications are enough 
E many a psychologist hesi- 

_to pursue sociometric research 
Bet It is to all workers, as 
HacN to beginners, in this position 
E orthway's A Primer of Sociom- 
š J is addressed, and is required 
eading, 
ate book orients the beginner to 
fs ure and role of sociometry and 

Vides the advanced worker with 
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perspective of what has been accom- 
plished and what lies ahead as un- 
touched territory for investigation. 
It achieves these ends in 48 pages of 
refreshing, clear, and compact writ- 
ing. The need for such a publication 
is widely felt and this one bids fair 
to become a classic of its kind in the 
midst of a literature which over the 
last 25 years contains about as many 
unreliable as dependable reports. 

Well-balanced and critical discus- 
sion is directed upon the merits and 
limitations of sociometric method, 
analysis, and evaluation of results. 
To this reviewer, the only unim- 
portant or irrelevant discussion for 
the purposes of the presentation ap- 
pears to be the brief use of compari- 
sons with other methods. Practically 
every major aspect of sociometric 
research (with the omission of nega- 
tive choice or rejection) is treated, 
followed by specific references to the 
most authoritative works bearing 
upon that aspect. Many varieties of 
Canadian, French, and American 
work are described in context under 
the particular aspect they shed light 
upon. The presentation is organized 
in relation to theory, practice, and 
application in research design. Prob- 
lems are outlined which raise our 
sights on research into what is growth 
and what is mental health. 

Last, not least, the author gives 
profoundly suggestive treatment to 
elements which appear likely to re- 
main universally important to choice 
reactions between people. Study 
directed toward them may be fruitful 
for many generations. Hence, A 
Primer of Sociometry can be seen as a 
practically useful and theoretically 
significant research guide to students 
of social, emotional, and group proc- 
esses. 

HELEN HALL JENNINGS. 

Brooklyn College. 
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WEINLAND, JAMES D. AND GROSS, 
MARGARET V. Personnel inter- 
viewing. New York: Ronald, 1952. 
Pp. vii+416. $6.00. 

Personnel Interviewing by Wein- 
land and Gross is not likely to receive 
much attention from psychologists. 
The authors have not approached 
their goal of having the book “... 
serve as a guide for all those who are 
concerned with the selection of per- 
sonnel and the maintenance of sound 
personnel relations” (p. iii). This 
text seems to be aimed at individuals 
in the personnel field who have had 
no training in psychology and have 
no intention of getting such training. 

The authors have taken upon 
themselves all-inclusive and ambi- 
tious purposes: “In recent years, 
scientific research has made available 
to supervisors of interviewers and 
personnel managers many new tech- 
niques and applications of personnel 
interviewing. Some of these, such as 
the non-directive, group, board and 
stress interviews and the Chappel 
[sic]-Chronograph have been used 
successfully in industrial personnel 
work. A major purpose of this book 
is to explain these practical and valu- 
able developments within a compre- 
hensive discussion of interviewing 
methods, and to show how they can 
be employed to advantage in firms of 
all sizes” (p. iii). The extent to which 
the authors achieve their purposes is 
illustrated by the fact that the entire 
space devoted to a discussion of the 
Chapple Chronograph consists of two 
short paragraphs, and one can hardly 
hope to learn much about the Chap- 
ple Interaction Chronograph from 
this description (p. 113). 

Some statements in the book cause 
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the authors state: 


one to wonder upon which gen 
psychological principles the b 
claims to be based: e.g., “Certs 
wrinkles in older people tell of 
peated grimaces; others of 
smiles. The interviewer who is int 
ested in people and takes care 
study them can often tell a good de 
about a person just by looking at hit 
(p. 16); and “Lavish jewelry 
gests an ego-centered personality i 
(p. 272). 

The authors have attempte 
this one short volume to deal w 
individual differences, personality « 
namics, motivation, directive a 
nondirective interviewing, s 
types of interviews, cor 
The shortcomings of the: 
of the various topics is to be expec 
in the light of their ambitious | 
tempt. The result will probably e 
the uninformed reader in a state! 
confusion. For example, on page 2 
“A correlation 
‘0.87 40.12’ would indicate that 
particular experiment a coi 
of 0.87 was found but that the 
correlation might not be found 
same tests, or measures, were used 
another case. Correlations mus 
ways be taken with a 'grain of sa 
the size of the probable error showi 
how much ‘salt’ to use." These sta 
ments must leave the reader still m 
knowing how much "'salt" is to be 
plied to the correlation coefficie 
“0.87 +0.12.” 

A book on personnel intervie 
needed in the industrial field. 
doubted by this reviewer that ti 
contribution by Weinland and Gro 
begins to fill the need. 

M. J. WAN 

University of Rochester. 
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A RECONSIDERATION OF THE PROBLEM OF INTROSPECTION 


DAVID BAKAN 
University of Missouri 


It is the purpose of this essay to 
raise a general question for rethinking 
in the perspective of modern times. 
Two related considerations are in- 
volved in the motivation to write and 
publish an essay on introspection as 
a method for the investigation of 
psychological phenomena. The first 
is a sense of society's need for a psy- 
chology which is more appropriate to 
its problems. The second is a convic- 
tion that although psychologists 
should be methodologically careful, 
they should not afford themselves the 
luxury of methodological snobbery. 
There is no investigatory method 
which is "pure," and which provides 


4 an absolute guarantee against the 


commission of error. If errors be 
committed, we look to the future for 
their correctives. In the meantime, 
and perhaps ultimately, we accept a 
pragmatic criterion. 

It is characteristic in the history 
of ideas that when some notion is re- 
Jected, even for adequate cause, 
Many seemingly associated notions 
get rejected with it. Often these 
mated notions may be sound. 
n. ji the theses of this essay is that 
E: as been the case with intro- 
cu ion. In the outright rejection of 

peel of introspection, much 
e) of considerable value was 

7 2 Es of the avowed rejection of 
is ethod, it has stayed with us in 
eral disguised forms. As Boring 


has recently indicated, “introspection 
is still with us, doing its business 
under various aliases, of which verbal 
report is one” (4, p. 169). Boring 
seems relatively uncritical of the 
manner in which we contemporarily 
avail ourselves of introspection. The 
argument here is for a careful and 
avowed use of introspection. 

In less disguised form introspection 
is with us in contemporary clinical 
psychology. The method of intro- 
spection is the method that the pa- 
tient uses, although there is little 
avowed recognition of it as the 
method of the clinician, except per- 
haps among the psychoanalysts (15). 
However, “therapy” is coming to be 
viewed as appropriate training for 
the aspirant clinician even in non- 
psychoanalytic contexts. 


A HyporHesis CONCERNING THE 
REJECTION OF THE INTROSPECTIVE 
METHOD 


The rejection of the method of in- 
trospection is coincident with the in- 
ception of behaviorism in America. 
The first important behavioristic 
pronunciamento took place in 1913 
(20). It is important to understand 
the immediate antecedents of be- 
haviorism in order to understand the 
wide popularity that it gained: Bor- 
ing's comprehensive history makes 
it unnecessary to recount the in- 
volved circumstances associated with 
the death of classical introspection. 
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Boring believes that it “went out of 
style... because it had demon- 
strated no functional use and there- 
fore seemed dull, and also because it 
was unreliable" (4, p. 174). In the 
next few paragraphs a hypothesis 
will be offered to supplement that of 
Boring. 

Psychology was in the throes of the 
Würzburg-Cornell struggle in the 
first]decade of the twentieth century. 
The Wiirzburgers had discovered 
imageless thoughts; and they them- 
selves hardly knew what to do with 
them. Titchener, at Cornell, sensed 
the staggering implications of the 
Würzburg findings, and struggled 
desperately to reject them (16). 

The psychological literature of the 
time is in many respects confused, 
repetitive, and—we might say— 
anguished. Psychology had, it 
seemed, got itself into absolutely in- 
extricable difficulties; and there was 
no one within the introspective move- 
ment who had the clarity of vision to 
go beyond these difficulties. Watson, 
for all the limitations that we may 
ascribe to him, had clarity and offered 
a program which psychologists could 
follow. 

Let us briefly examine the nature 
of some of the Wiirzburg findings. 
They discovered that thought was 
possible without images; and that 
thought was guided by states vari- 
ously designated by the terms Auf- 
gabe, Bewusstseinslage, and determini- 
erende Tendenz. The favored method 
was the Ausfragemethode. Mayer and 
Orth (11) used the method of free 
association to a verbal stimulus, in- 
structing the subject to report every- 
thing that went on between the hear- 
ing of the stimulus word and the 
making of the response. Messer (14) 
finds himself forced to posit un- 
conscious processes underlying the 
processes of thought. Ach (1, 9)! in- 
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troduces the concept of the w 
motivation, as guiding the L 
processes; he uses a probing in 
gatory procedure; and he uses h 
nosis. Bühler (6) indicates that it: 
important, in the study of f 
thought processes, to empathize an 
sympathize with the subjects el 
gaged in this kind of experimentatio 
Then, the problem is dropped lik 
the proverbial hot potato. Kiilp 
the leading figure in the Würz 
movement, leaves Würzburg and 
to Bonn in 1909, and the work prat 
tically ceases. Bühler posthumoush 
publishes Kiilpe’s lectures which, 
cording to Boring, “contain a 
complete system of psychology. 
the chapter on thought was m 
Bühler said that Külpe had not bee 
lecturing on the topic" (3, p. 407 
In the light of the foregoing, 
the light of what we have lear 
from psychoanalysis, a rather simpl 
explanation suggests itself. These im 
vestigators were using themselve 
and each other as subjects. They 
struck the unconscious, and pa 
larly unconscious motivation 
had to probe it if they were ton 
any headway. However, as we ki 
today, probing the unconscious 
to generate anxiety and resis 
and these investigators simply 
not prepared to undergo the n 
sary personal trials involved. Borini 
(4, p. 186) suggests a relationship bi 
tween the Würzburg school am 
Freud, but makes little of it. 
Psychology had two possible à 
natives: either to widen its inv 
gations to take account of am 
study the role of unconscious mottv 
tion on the thought processes, OF 
detour. Academic psychol 
toured; and detoured in two 
!The writer could not locate a 


Ach's book. This sentence is bi 
Humphrey's (9) summary of Ach's work. 


PROBLEM OF INTROSPECTION 


It detoured by way of behaviorism, 
completely rejecting (at least avow- 
edly) the whole method of intro- 
spection, and it detoured by way 
of gestalt psychology. The former 
dropped the whole concept of mind, 
conscious and unconscious. The 
latter adopted as a basic principle 
that whatever introspection is done 
should be naive introspection, with 
no probing and no analysis, thus 
presenting intrusion upon the uncon- 
scious. 


A Basic DISTINCTION FOR 
INTROSPECTION 


Perhaps one of the most important 
distinctions necessary for the under- 
standing of the nature of introspec- 
tion is the classical one between the 
experience and that of which the ex- 
perience is. Itis the distinction which 
is contained in the classical one of 
Kundgabe versus Beschreibung (4). 
Itis the distinction which is indicated 
by the concept of stimulus-error (2). 
Itis the distinction which the psycho- 
analyst makes when he concerns him- 
self primarily with a memory, as con- 
trasted with the event to which the 
memory presumably refers. 

The distinction is somewhat dif- 
ficult to grasp when we deal with per- 
ception, Let us consider a simple ex- 
perience reported as “I see a book." 
From the point of view of this dis- 
tinction it is one or another of two 
reports: "I see a book," or “I see a 
book." In the first instance it is a re- 
Port of experience as experience. In 
the second instance the reference is to 
the object rather than to the'experi- 
ence of the object. The first can be 
true, and the second false, as, for ex- 
ample, in an hallucination. 
ig distinction is easier to make 

en we consider something like 
one: It is hard to make when the 

perience involves an external stim- 
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ulus. It is of interest that when 
Washburn made her presidential ad- 
dress before the American Psycho- 
logical Association (19) in 1921 she 
felt that it was necessary to say that 
introspection is proper only where 
there is an external stimulus. This, 
she believed, would endow introspec- 
tion with “objectivity’—an unfor- 
tunate semantic identification of 
“object” with “objectivity.” It is 
here, probably, when the Watsonian 
noose was drawing very tight around 
the neck of introspection, that intro- 
spection surrendered the very thing 
which was its major merit. Intro- 
spection has its maximum value on 
those very experiences for which 
there may be no conspicuous physical 
stimuli, such as grief, joy, anxiety, 
anger, depression, exhilaration, etc. 


THE PROBLEM OF LANGUAGE 
AND COMMUNICATION 


A major criticism which has been 
leveled against the method of intro- 
spection is that the data of introspec- 
tion are not public. In the case of 
overt behavior it is possible, at least 
in principle, for two observers to ob- 
serve a given phenomenon simul- 
taneously. This has sometimes been 
referred to as the criterion of pub- 
licity; and it has been said that data 
are not acceptable unless this cri- 
terion has been satisfied (again, at 
least in principle). 

That introspective data are not 
public in this sense is not to be ques- 
tioned. What is to be questioned is 
whether the criterion is essential. 
What is the value of the criterion of 
publicity? Its value, presumably, 
inheres in the conviction that it 
avoids error and provides for verifica- 
tion. However, can we not have veri- 
fication without publicity? Let us 
consider one of the most acceptable 
kinds of investigatory procedure from 
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this point of view, the conditioning 
experiment. There is no way of veri- 
fying Pavlov's experiments today by 
having another observer watching 
them, since, to say the least, Pavlov's 
dogs are probably all quite dead. In 
order to verify Pavlov's findings we 
would have to get other dogs. Fur- 
thermore, the fact that two people 
could have stood by to count the 
number of drops of saliva is quite ir- 
relevant. If the criterion of publicity 
is not met by introspection, it is not 
really very serious as long as each 
scientist has, so to speak, at least one 
"dog" whom he can observe directly. 

The crisis which was generated by 
disparate results from Würzburg and 
Cornell, with the one finding image- 
less thoughts and the other not find- 

_ ing them, was hardly adequate reason 
for the total rejection of introspec- 
tion. Disparate results from different 
laboratories are usually provocative 
of further investigation, rather than 
the occasion for dropping the prob- 
lems, the methods, and the funda- 
mental points of view involved. The 
failure of the introspective method to 
satisfy this naive criterion"of pub- 
licity could hardly have been the real 
reason for the rejection of introspec- 
tion as a method. 

A more important problem is the 
possibility of publicity, not of the 
data, but of the report. Even though 
the process of introspective observa- 
tion is, in a sense, private, the infor- 
mation gleaned from the observations 
must be public. This raises the ques- 
tion of language and communication. 
There are two questions that may be 
asked in connection with language 
with respect to introspection: First, 
if we relate our introspections to one 
another, would we understand one 
another? Second, if we do understand 
one another, how does this come to 
pass? If the answer to the first ques- 
tion is to any degree affirmative, then 
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to that extent is the criterion of pul 
licity of report satisfied. < 

For the answer to the first questio 
we appeal, at the very least, to com 
mon sense. If we hear a person si 
“I am sorry," or “I am worried,” 6 
“T feel sick," etc., there is hardly an 
question but that we understan 
what he means. There are : 
when we may not believe him; but t 
possibility of fraud, intentional or ur 
intentional, or of lack of precisio 
exists with respect to any methodo 
logy. The fact is, however, that? 
understand him. 

The answer to the second ques 
now becomes a matter for empi 
investigation. This is not the place 
enter into a detailed discussion of th 
psychology of language learninj 
However, it is extremely pertinent | 
indicate that the theory of languag 
learning implicit in contemporary | 
havioristics is much more simple 
is consistent with the facts. This i 
plicit theory may be roughly cha 
terized as follows: 

The teacher holds up a ball 
says, “Ball.” The learner rep 
"Ball" The learner then, presum 
bly, comes to “know” the meaning 
the word. Certainly the theory 
stretched to the breaking point wh 
presented with the fact that we 
fairly well understand the meanin 
of words such as “sorrow,” "'feeli 
“nausea,” "if," “but,” etc. 


essay En in part, attemp 
present to English-speaking deti 
some of the contributions of Fra 
Brentano, In the judgment of | 
writer, Brentano is one of the mo 
important figures in the history 
psychology. The major work 
Brentano with respect to psycho 
(5), has not, as far as could be det 
mined by the writer, been translat 
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into English. Of Brentano and 
Wundt, Titchener wrote: “The stu- 
dent of psychology, though his per- 
sonal indebtedness be also twofold, 
must still make his choice for one or 
the other. There is no middle way 
between Brentano and Wundt” (18, 
p. 108). For the most part, the choice 
of the classical introspectionists was 
for Wundt. Wundt and Brentano 
published their major psychological 
works at about the same time. Two 
major schools of thought issue from 
Brentano. One is the already- 
mentioned Wiirzburg school. The 
other is psychoanalysis, with Bren- 
tano having been the only academic 
psychologist under whom Freud stud- 
ied (12, 13). Psychoanalysis, how- 
ever, differed from the Wiirzburgers 
with respect to a readiness to face the 
unconscious. It may have been easier 
for Freud to break through to the 
unconscious because it was not his 
own unconscious but the unconscious 
of his patients. It was only second- 
arily that Freud used himself as sub- 
ject. The Wiirzburgers, on the other 
hand, used themselves and each other 
as subjects. 

Brentano, Kiilpe, and Freud con- 
ceived of introspection not as of the 
Present, but the past. They took 
Seriously what was then a common 
observation that introspection at the 
Moment an experience is taking place 
changes the character of that experi- 
ence, If we are interested, say, in 
anger, then introspection at the mo- 
Ment of anger tends to reduce the 
net It is only when anger is past 
Usii it can be properly examined. 
cid the method of introspection, 
it BR retrospectively, makes 

Sible to examine psychological 
“Seepage which cannot readily be 

EM in the laboratory, except per- 

Ce very great ingenuity. 
of ". difficulty of the introspection 

undt and Titchener was ade- 
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quately recognized by McDougall. 
He wrote: “Experimental introspec- 
tion has obvious limitations. Many of 
our most vital and interesting experi- 
ences, such as grief or joy or fear or 
moral struggle, cannot be induced at 
will, except perhaps, in very slight 
degrees. And, under the most favour- 
able conditions, introspection of our 
more vivid and vital experiences is 
difficult, because we are apt to be 
primarily interested in the events of 
the outer world in which we are 
taking part, if only as observers. 
Then again the very act of introspec- 
tion does to some extent modify the 
experiences we wish to observe and 
describe; so that in introspecting we 
partially defeat our own purposes" 
(10, p. 4). 

Thus, the type of introspection 
which was advocated by Titchener, 
and which was the object of attack 
by the anti-introspectionists, was à 
type which, by its nature, could not 
attack the important aspects and 
kinds of experience. The cry that a 
psychology was wanted which would 
have some usefulness was completely 
justified when the object of attack 
was the kind of introspection advo- 
cated by Titchener. 


Errors OF INTROSPECTION 


A characteristic of good science is 
that it is ever alert to the possibility 
of the commission of systematic types 
of errors. One of the major criticisms 
which has been leveled against in- 
trospection is that its results are un- 
trustworthy. In the following few 
paragraphs a brief attempt will be 
made to examine the problem of the 
trustworthiness or validity of intro- 
spective reports. 

There is a respect in which intro- 
spective observations are more trust- 
worthy than observations made by 
the use of the sense organs. Sense 
organs may be defective. Sense or- 
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gans are subject to illusion. Observa- 
tions made with the sense organs are 
subject to the accidents of angle of 
regard, kind of illumination, noise 
level, etc, In the last analysis, the 
sense organs are subject to hallucina- 
tion. Introspection is a method 
which does not involve the sense 
organs in the usual fashion, and 
therefore all of the error tendencies 
associated with the sense organs do 
not exist for introspection. 

However, introspection has asso- 
ciated with it other sources of error. 
But even at this date, we have 
achieved a certain amount of prog- 
ress in isolating them. We know 
about the stimulus-error. We are 
aware of the tendency to suppress 
data (repression), of the tendency to 
supply socially acceptable data in 
place of other data (distortion, ra- 
tionalization, displacement, etc.). 
But, insofar as we are aware of these 
error tendencies we can take precau- 
tions against their commission. In 
this respect introspection is no differ- 
ent from any other set of methods in 
science. To be aware, for example, of 
the tendency toward rationalization 
stimulates us to challenge our intro- 
spective findings to determine 
whether they have resulted from the 
rationalization process. It isa matter 
of time and careful work to discover 
other error sources. We have dis- 
covered suggestion, cultural deter- 
mination, ethnocentrism, etc.; and 
the list will probably lengthen as our 
experience with the method is en- 
larged. 


A FUNDAMENTAL DIFFERENCE BE- 
TWEEN CLASSICAL INTROSPECTION 
AND PSYCHOANALYSIS 


Psychoanalysis has one major limi- 
tation with respect to our purposes 
which was not present in classical 


DAVID BAKAN 


introspection. This is that the mi 
objective of psychoanalysis is th 
apy." The major objective à 
classical introspectionists was @ 
acquisition of knowledge. This i 
fundamental difference. ! 

Essentially, what is being adw 
cated in this paper is the use of W 
psychoanalytic method with the g 
jective of the classical intros : 
ists. 

It has been indicated that v 
being advocated in this paper 


a science of psychology which i 
practical implications. 
there is an old lesson in the hi 
science of which we avail o 
Whereas knowledge may have 
ticality as its ultimate objec 
has been found that we sometimes 
better, both practically and 
tically, if we temporarily forsa 
practical objective. 
In taking the objective from 
classical introspectionists, it is n 
sary to make some modificatio 
the psychoanalytic procedure. A 
though the investigator should b 
"free" in his associations, he sli 
not permit himself to wander too 
from the subject under investiga 
His associations should stay U 
the influence of the task at hand. € 
course, as in any investigation, dec 
sions of relevance have to be m 
and sometimes only a dim intuit 
dictates the nature of these € 
sions. Although there is no a p 
method for determining releva 
the investigator should always | 
tempt to keep in mind that h 
serving science primarily, and | 
self secondarily. s 
* This is true even though Freud d 
visage that "the future will probal 
tribute far greater importance to psy 
analysis as the science of the unconscious 
as a therapeutic procedure” (7, p. 673). 
(UN 
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A “MINIATURE” INVESTIGATION OF 

THE RETENTION AND REVELATION OF 

Secrets BY THE METHOD OF RETRO- 
SPECTIVE ANALYSIS 


In accordance with what has al- 
ready been said the writer attempted 
to conduct an investigation of the 
kind suggested. It is a “miniature” 
investigation in that it was conducted 
only over a very short period of time. 
It was conducted for five days for 
about an hour and a half each day. 

There were several reasons for the 
choice of the topic, retention and rev- 
elation of secrets. One of these is 
that the topic seemed to be one which 
is more amenable to introspective 
investigation than to other methods. 
By its very nature a secret is some- 
thing which may not reflect itself in 
overt behavior. The latter almost 
constitutes a definition of a secret. 
Another reason for the choice of the 
topic is that it seems to be a funda- 
mental one for any kind of introspec- 
tive investigation. It seemed impor- 
tant to obtain information concerning 
the nature of secret retention and 
Secret revelation before very much 
Progress could be made with other 
topics. A third reason was that the 
topic seemed to lie close-to the oft- 
Stated objective of psychology as 

Ng prediction and control of hu- 
man behavior. 

4 the procedure simply involved 

tting down to the typewriter and 
typing whatever came, after the de- 
Cision concerning the topic was made. 
E choice of the typewriter was 

ade primarily on the basis that the 
M found himself to be more 

en TH Me than either writing by 
Machine ing into a recording 
je rirtue of the nature of the sub- 

, Chosen, the writer attempted to 
Write "a h ” . 

s though" the material would 
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never bereleased. Under any circum- 
stances, even if this was a myth, the 
sense of the possibility of editing was 
not mythical. At the moment the 
writer does not consider it wise to re- 
lease the protocol. However, one 
example will be given. The following 
is taken from the record with some 
editing: 
. . . What is one of the secrets such as thee and 
me have? once talked to a professor of 
zoology at lunch about the academic life. He 
commented that over the head of every 
academician hangs a sword on a thin string. 
No matter how much you do, you never feel 
that you are doing enough. I am reminded of 
Freud's dream of Irma's injection. He says, 
“I am always careful, of course, to see that the 
syringe is perfectly clean. For I am conscien- 
tious.” The italics are mine. If he felt that 
he were really conscientious, if he had no feel- 
is connection, why 


conscientiousness is my "secret." 
find myself confessing to lack of conscientious- 
ness. But I was not able to do so until I was 
able to remember something which would 
make it possible for me not to have my guilt 
alone. I brought up the zoology professor. 
When I wrote the above line about him I 
hesitated for a moment on the question of 
whether or not to use quotation marks, or to 
write it in the way that I did. The quotation 
marks would have had to come, in all honesty, 
after the word “string.” I wrote on, however, 
“No matter how much you do, you never feel 
that you are doing enough." This is what I 
would have liked him to have said. I added it 
to give the impression that he had said it, 
but not quite lying about it. 

I think that what has been said above can 
be generalized. We are more prone to confess 
a secret guilt when we can believe that others 
have the same secret guilt... 


The general pattern involved in 
this kind of writing is that of an oscil- 
lation between a free expressive mood 
and an analytic mood, with the free 
expression being the subject of the 
analysis. The question of what a 
given item of free expression might 
mean with respect to the major topic 
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under investigation was repeatedly 
asked. 

In the course of this investigation 
a series of propositions, including the 
italicized one above, were formulated. 
This list can be considered to be the 
yield of this "miniature" investiga- 
tion: 

1. (Given above.) 

2. Persons with a secret guilt tend 
to create situations in which they can 
"see" that others have the same 
secret guilt. 

3. A secret is a secret by virtue of 
the anticipation of negative reactions 
from other people. 

4. A secret is maintained in order 
to maintain some given perception of 
one's self in others. 

5. Persons who associate with one 
another in the context of a larger 
group, who have a secret from that 
larger group, will create a metaphori- 
cal or otherwise cryptographic lan- 
guage in which to discuss the secret. 

6. In order to conceal a secret one 
may tend to reveal a fabricated 
"secret," or a less-secret secret, in 
order to generate the impression that 
one is being open and frank. 

7. One of the important secret 
areas in our culture is in connection 
with our intellectual limitations. 

8. When an individual has a secret 
he will attempt to “protest” that the 
opposite is the case, if the secret has 
an opposite. 

9. The revelation of a secret may 
involve the attempt to generate the 
impression that one is telling a joke, 
to achieve the double purpose of rev- 
elation on the one hand and dis- 
belief on the other. 

10. In the revelation of a secret 
one may attempt to generate the 
impression that one degrades one's 
self in one's own eyes, in order to re- 
duce the degradation that one antic- 
ipates will be the reaction of others 
to the revelation. 
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11. If A knows a secret abou 
and B knows a secret about A, 
if A discovers that B has revealed 
secret, then A will be inclined to 
veal B's secret. 

12. If an individual changes 
group identification from Group 
to Group B, and if Group A has: 
secret which it keeps from Group] 
that individual will be inclined to t 
veal Group A's secret to the memb 
of Group B. 


DISCUSSION OF THE '"MINIATUR 
INVESTIGATION 


The simple fecundity of the meth 
soon became evident. After the de 
sion was made to attempt it and 
brief beginning was made, it beca 
apparent that this was, iv use a te 
from the vernacular, a veritable mi 
of information. Essentially it cap 
talizes on the fact that the inve 
tor has had twenty or thirty or for 
or fifty or sixty or seventy years fto 
the collection of various kinds of il 
formation. Certainly one of the 
fects of this kind of data collection! 
that it is not systematic in the usui 
way in which we understand th 
term. Yet it is the result of years 0 
trial and error, of a kind which mo 
laboratory types of investigation ¢ 
not generally get. It may be argu 
that these data have been unerit 
cally gathered. This is a valid 
However, the necessary critical 
can be supplied in the course of tl 
investigation itself. f 

This kind of investigation can ' 
severely hampered by what may 9! 
loosely designated as "ethical" CO 
siderations. Let us consider, for € 
ample, the third proposition enume 
ated above, that a secret is mal 
tained in order to maintain a giv’ 
perception of one's self in others. 4 
virtue of the intimate connection D 
tween ethics, in this larger sense, a" 
the kind of data which may be 
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the subject of an introspective in- 
vestigation, it is extremely important 
that the investigator attempt, to the 
degree that he can, to divest the in- 
vestigation of ethical considerations. 
Methodologically this divesture may 
involve a preliminary investigation 
of the ethical considerations them- 
selves. Also, it must be added that, 
for some kinds of problems to be in- 
vestigated by these methods, less 
may be required in the way of pre- 
liminary investigation than for other 
problems. However, for the investi- 
gation of any problem by these 
methods, a scientific and objective 
attitude is a prerequisite. 

One of the major merits of this kind 
of an approach is that it studies the 
phenomena of psychology directly, in 
a manner which is rarely the case in 
most psychological investigations. 
Actually, the kind of material which 
issues from an introspective investi- 
gation of the kind being advocated is 
presupposed in many other psycho- 
logical investigations. Consider for 
the moment the “lie” scale of the 
Minnesota Multiphasic Personality 
Inventory. The test presumably 

gets at" the kind of thing which has 
been investigated in the investigation 
on secrets cited above. However, the 
items of this scale were selected be- 
Cause they would presumably be an- 
Swered negatively by persons who 
were trying to put themselves “in the 
most acceptable light socially" (8). 
This presumes, with little qualifica- 
tion, the content of the fourth propo- 
sition, as well as about 15 preconcep- 
tions concerning the meaning of social 
acceptability. (There are 15 items in 
the “lie” scale.) 
Mi arthermore, had the makers of the 
AUI critically examined the na- 
ab i secrets in the way in which it 
io een begun in the above investiga- 

n, they would have seen that there 
are other dynamics of lying, in addi- 
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tion to the one of which they did avail 
themselves. For example, proposi- 
tion 6 indicates that a certain amount 
of truth-telling may simply be a de- 
vice for "covering up" one or more 
other lies. It may well be that the 
operation of the dynamic indicated 
by proposition 6 acts to depress the 
“lie” score when lying is really taking 
place. A full awareness of the kind of 
thing that issues from such an investi- 
gation can greatly enhance the effec- 
tiveness even of pencil-and-paper 
tests. 

From a more theoretical point of 
view, if we seriously accept the mis- 
sion of psychology as being that of 
the prediction and control of human 
behavior, the psychology of secrets is 
an important link in the chain of 
psychological findings and theory. 
Investigators, no matter what they 
are investigating, must be cognizant, 
at the very least, of the possibility of 
dissemblance when they use human 
subjects. To predict and control 
an individual's behavior it is impor- 
tant to know, for example, his group- 
identifications, his objectives, his 
values, etc. Many of these items of 
information are secret. They may 
even be secret to the subject himself. 
And under any circumstances they 
are not items of information which 
will be revealed readily. Thus, until 
psychologists develop a rather full 
understanding of the dynamics of 
this phenomenon, the ignorance of 
this phenomenon will stand in the 
way of other investigations. 

What has been said in the above 
paragraph would be considerably less 
cogent if the phenomenon of the secret 
played only a small role in connection 
with other phenomena. However, 
secrets play their most important 
role in those phenomena which are 
most vital. A psychology that seeks 
to understand these vital phenomena 
must have an appreciation of the 
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phenomena of secret retention and 
secret revelation. "Whether we are 
interested in the problems of mar- 
riage, industrial management, leader- 
ship, prejudice, loyalty, delinquency, 
international affairs, politics, military 
strategy, litigation, business prac- 
tices, economics, etc., the psychology 
of secrets is extremely pertinent. 
And, the psychology of secrets yields 
most effectively to the method which 
is being proposed. 

It might be indicated that an ade- 
quate psychology of secrets would 
represent an extremely important 
contribution to our country in its 
present state. An adequate psy- 
chology of secrets would provide us 
with an insuperable advantage over 
our enemies and potential enemies. 
Concretely, for example, we could use 
the information about the psychology 
of secrets for enhancing the skill of 
the military interrogator in extract- 
ing information from prisoners of 
war; and we could teach our own 
soldiers about possible techniques 
that may be used against them in the 
attempt to extract information from 
them, so that they may be prepared 
for them. Also, an adequate psychol- 
ogy of secrets might make quite un- 
necessary some contemporary politi- 
cal investigatory practices which 
enjoy some popularity only in lieu of 
more scientific devices, 

Although the psychology of secrets 
is perhaps a central and basic one as- 
sociated with the method, investiga- 
tions could and should be pursued 
with great profit on other problems. 
Thus, for example, problem solving 
and decision making can and should 
be investigated by the method of ret- 
rospective analysis. Investigations 
on status, power, anxiety, fear, ag- 
gression, aesthetic experience, learn- 
ing, communication, memory, con- 
cept formation, perception, judgment, 
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charity, loneliness, betrayal, etc. 
could and should be carried out to 
enhance our understanding of these 
phenomena. 


THE PROBLEM OF VALIDITY OF THE 
FINDINGS FROM THE "MINIATURE" 
INVESTIGATION 


Perhaps the critical question in the 
mind of the reader up to this point is 
that of the “validity” of the findings 
of an introspective investigation. 
The problem of validity has already 
been discussed previously, but some- 
what abstractly. In this section, the 
problem will be dealt with somewhat 
more concretely, with the findings of 
the “miniature” investigation before 
us. 

The propositions which issued from 
the “miniature” investigation are, at 
least, what may be considered to be 
"hypotheses" for investigation by 
other methods. Thus, at the very 
least, the method may be recom- 
mended as a device for systematically 
getting hypotheses as contrasted 
with, say, the casual reaching out for 
a pair of variables and hypothesizing 
a relationship between them. j 

Again, as has already been indi- 
cated, it may be used as a meth 
whereby an investigator can bring 
his presuppositions concerning an 
investigation to formulation; where 
he can critically examine his presup- 
positions; and where he might be 
helped in conceiving of other presup- 
positions against which he can con- 
trast the ones he is using. Or, the 
method could be used as a device 
whereby an investigator, having go 
some experimental results which he 
cannot understand, provokes his 
imagination to arrive at some kin of 
an explanation of his results. The 
deliberate and avowed adoption of the 
method would be extremely helpfu 
in these respects. 


PROBLEM OF INTROSPECTION 


However, the writer believes that 
the method warrants more than this. 
As has been indicated, the method has 
a direciness which is not to be found 
in any other method of investigation 
of psychological phenomena. In any 
investigation each thing which lies 
between the phenomenon and the 
data is a source of error. These 
sources of error are minimized by the 
method which is being proposed. All 
errors such as failure of the subject to 
cooperate (e.g., rehearsal when in- 
structed not to do so in studies on 
reminiscence), dissemblance, failure 
to comprehend instructions, refusal 
to believe the expressions of the in- 
vestigator's avowed intentions, fear 
of  hidden—or manifest—micro- 
phones, lack of skill on the part of the 
subject (e.g., fixating a point in a 
vision experiment), refusal to take a 
"naive" attitude (in gestalt experi- 
ments), the lack of control over hu- 
man subjects (e.g., subjects in prob- 
lem-solving studies already knowing 
the solutions to problems but not in- 
forming the investigator), subjects 
knowing the intention of the investi- 
gator (e.g., subjects knowing that the 
experimenter is interested in demon- 
Strating a relation between frustration 
and aggression, and therefore conceal- 
ing their felt aggression), etc. are 
minimized in this kind of investiga- 
tion, 

.The propositions which were 
yielded by the “miniature” investiga- 
tion also have a certain kind of self- 
evidence associated with them. They 
elicit the “of course" response. Some 
of the propositions may require 
further specification and further qual- 
ification, Nevertheless, they are in 
Some sense obvious. It is the sense of 
Sel-evidence which is associated, 
Perhaps, with the axioms of Euclid- 
ean geometry. The nature of self- 
evidence is, of course, an extremely 
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difficult problem and perhaps more 
properly falls in the province of the 
philosopher. Or, perhaps, self-evi- 
dence is a problem to be investigated 
by the very methods which are here 
proposed. However, whatever the 
ultimate nature of self-evidence may 
be, there is a sense in which the re- 
sults of an introspective investigation 
are of this type. 

Now, of course, the matter of self- 
evidence may be challenged by the 
question: Self-evident to whom? In 
one respect this as a valid question. 
But in another respect it is not. It is 
valid in that if we are to know that 
it is self-evident it must be self- 
evident to someone. However, when 
the mathematician uses the term 
self-evident he means something which 
is intrinsic to the proposition, rather 
than something dependent upon the 
reader or the hearer of the proposi- 
tion. For the mathematician it is the 
self-evidence of the proposition which 
makes it possible for the person to see 
the self-evidence, rather than the 
reverse. It is this characteristic which 
is shared by introspective proposi- 
tions. 

As a matter of fact, some of the 
propositions which issued from the 
“miniature” investigation seem to 
partake of greater self-evidence than 
others. Thus, for example, proposition 
4 seems to be quite self-evident, 
whereas proposition 5 seems to be 
somewhat less self-evident. And even 
the seeming self-evidence of proposi- 
tion 4 may be quite culture bound. 
However, what has been reported is 
only an extremely limited investiga- 
tion, only a beginning and only a 
sample. Nevertheless, what has been 
presented is enough to suggest the 
possibility of achieving the kind of 
self-evidence that has been indicated. 

Two related, but distinguishable, 
problems are those of replication and 
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generality. Can such an investigation 
be replicated? The answer is affirma- 
tive, although the difficulties of repli- 
cation should be recognized and ac- 
count should be taken of them. If an 
investigator attempts to replicate his 
own investigation at another time, he 
will inevitably be under the influence 
of what he hasalready done. In repli- 
cating such an investigation, the very 
replication itself should come under 
the scrutiny of the investigator. He 
should challenge, for example, his 
personal identification with the re- 
sults he has already obtained, and 
prepare himself for finding both 
novelty and contradiction with re- 
spect to his earlier investigation. If 
one investigator is interested in repli- 
cating the investigation of another 
investigator, he should carefully take 
into account the possibility of sug- 
gestion, or of his willingness to accept 
the results of the earlier investigator 
(particularly if the first investigator 
has prestige for the second investi- 
gator) He should take careful cog- 
nizance of possible motivation for 
showing the earlier investigator to be 
in error, etc. In some instances it 
may be extremely worth while to in- 
vestigate some topic without reading 
the results of the earlier investigation 
until the completion of the second in- 
vestigation, and the making of a com- 
parison later on. Carefully controlled 
experimentation to determine possi- 
ble effects of suggestion, for example, 
is extremely feasible. 

The generality of the results of 
such an investigation is a somewhat 
more difficult problem, but it is a dif- 
ficulty which is not unique to intro- 
spective investigation. One investi- 
gator's results can be compared with 
another investigator's results, so that 
the problem of uniqueness with re- 
spect to a single investigator is viti- 
ated. However, one may ask, in the 
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event of consistency of results among 
a group of investigators, may the 
findings not be unique to a group of 
persons all of whom are introspective 
investigators? There is no easy an- 
swer to this problem. However, we 
face the same problem in other in- 
vestigations. May not the results of 
studies in rote learning be largely 
unique to college sophomores? May 
not the results of studies in, say, 
secondary reinforcement be unique to 
rats, or more particularly laboratory 
rats, or even more particularly white 
laboratory rats, or still more particu- 
larly tamed white laboratory rats, 
etc.? May not all findings concerning 
mental abnormality be unique to 
mentally abnormal persons contacted 
by investigators, and may not these 
very contacts be a major determinant 
of the findings? 

The answer, of course, to each of 
these questions is contingent upon 
some decision concerning relevance, 
a decision that has to be made in 
connection with any investigation. 
Actually, the kind of investigation 
being advocated has an advantage in 
this respect over other kinds of in- 
vestigation. For, in an introspective 
investigation the very decisions con- 
cerning relevance can come under the 
same scrutiny as the phenomena be- 
ing investigated. 


PossIBILITY AS A FINDING 


Theargument concerning the valid- 
ity of the findings from an intro- 
spective investigation thus far has 
been concerned with validity in the 
usual sense, i.e., the argument has 
been concerned with the truth or 
falsity of propositions which issue 
from an introspective investigation. 

There is, however, a value to such 
propositions which is over and be- 
yond their validity. This is their 
possibility rather than their truth or 
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falsity. The knowledge that a certain 
dynamic is possible enhances the 
sensitivity of the psychological ob- 
server. To make this point concrete, 
let us again consider the military 
interrogation situation. Suppose that 
the interrogator is interested in deter- 
mining the contents of some supplies 
which have been moved in by the 
enemy. Now suppose that the prison- 
er being interrogated knows these 
contents but does not wish to reveal 
them. The prisoner may avail him- 
self of the dynamic indicated by 
proposition 6 (the revelation of less 
secret secrets in order to generate the 
impression that he is being open and 
frank), and inform the interrogator 
at length about a great number of 
lesser secrets, but not the nature of 
the supplies. He may say, "I will 
tell you everything that I know, but 
I do not know what was in those 
trucks.” An interrogator who was 
not aware of the possibility of proposi- 
tion 6 might be lulled into believing 
the man. The interrogator might 
say to himself, “he is evidently tell- 
ing all that he knows." On the other 
hand, an interrogator who is aware 
of the possibility of the action of the 
dynamic indicated by proposition 
6 would be aware of the possibility 
of this kind of deception, and would 
be less likely to be “taken in." 
Insofar as at least one person can 
contrive such a device for deception, 
then such a device is possible, and 
some other individual may have con- 
Ceived of it and may be making use of 
i. The truth of the proposition in 
this respect becomes quite secondary. 
What is important, simply, is that 
Someone thought of it; and if one 
Person thought of it, other persons 
might think of it. 
Dm this respect psychologists can 
5 € a major contribution to society 
ot only by rendering to society 
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established truths, but also by render- 
ing to society established possibles 
with respect to psychological dy- 
namics. In the matter of prediction 
and control of human behavior, a 
knowledge of what an individual 
might possibly do, or possibly feel, 
or possibly think places us well on 
the way toward the achievement of 
our objective. Given a detailed 
knowledge concerning the possibles 
we can act in such a fashion as to 
discourage some from becoming 
actualities, and to encourage others 
into becoming actualities. The prag- 
matic usefulness of knowledge of 
possibles extends from the clinical 
situation to world affairs. 

As has been suggested, these pos- 
sibles may indeed turn out to be 
truths in the larger and more scien- 
tific sense. But even if they fail to 
meet the criteria for general scientific 
propositions, they have value in the 
sense indicated above. 


SUMMARY 


It is the purpose of this essay to 
raise the question of the appropriate- 
ness of the method of introspection 
for rethinking in the perspective of 
modern times. Some features of the 
history of introspection in the first 
decade of the twentieth century have 
been pointed to. The hypothesis is 
advanced that introspection was 
dropped because the classical intro- 
spectionists had come to a point 
where they would have had to probe 
the unconscious to make any prog- 
ress. From psychoanalysis we have 
learned that probing the unconscious 
generates anxiety and resistance. 
The old distinction between an ex- 
perience, and that of which the ex- 
perience is, is again made. The prob- 
lems of communication of introspec- 
tive observations are discussed. It 
is claimed that introspection should 
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lem of the validity of the findings is 
discussed. The problems of replica- 
tion and generality are discussed. 
The propositions which issue from 
such an investigation can be viewed 
as having three values: first, as hy- 
potheses to be investigated by other 
methods, and otherwise to supple- 
ment other methods; second, as 
propositions which have a certain 
kind of self-evidence associated with 
them; third, as possibles which can 
give us real assistance in achieving 
the objectives of prediction and con- 
trol of human behavior. 


be retrospective, consistent with the 
position of Brentano, Külpe, and 
Freud. The problem of errors in 
introspection is discussed. The posi- 
tion is advanced that retrospective 
analysis should take its objective from 
the classical introspectionists and its 
method from the psychoanalysts, with 
some modification. A "miniature" 
investigation on the psychology of 
secret retention and secret revelation 
is described. The implications of the 
investigation both as an investigation 
in its own right and as an example of 
the method are discussed. The prob- 
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The use of check lists, charts, and 
rating scales for the objective record-. 
ing and later evaluation of change in 
the behavior and symptoms of psy- 
chiatric patients is not new. Devices 
such as the Phipps Psychiatric Clinic 
Behavior Chart (4) have been used 
for a half century on psychiatric 
wards to record patient change. 
Plant (17) reported a rating scheme 
for describing patient behavior on 
the ward in 1922. In 1933, Moore 
(14) published his chart and “Schema 
for the Quantitative Measurement 
of Abnormal Emotional Conditions" 
containing some 36 carefully con- 
structed scales. The interest in men- 
tal health problems intensified by 
events in World War II resulted in 
a marked upswing in the develop- 
ment of procedures for the objective 
measurement of psychopathology and 
personality change. It is the purpose 
of this review to examine briefly 
those rating scales and check lists 
designed to describe psychiatric pa- 
tients on the ward or in the interview, 


which have appeared during the past 
ten years. 


SCALES ror USE By PSYCHIATRIC 
AIDES AND NURSES 


Six scales suitable for use in mental 
hospitals by nurses and psychiatric 
aides have been reported. These are 
generally designed to secure (a) 
quantified descriptive reports of read- 
ily observable patient ward behavior, 
a o quantitative estimates of 
oe adjustment. The first of 
(2a). the Gardner Behavior Chart 

^ a rating scale developed by 

Ucox out of work with psychotic 


patients, is designed to secure re- 
ports of easily observed patients' 
ward behavior from nurses and at- 
tendants. The 15 categories or scales 
used are: attention to personal ap- 
pearance, sleep, appetite, sociability, 
activity control, noise disturbance 
control, temper control, combative- 
ness control, care of property, self 
entertainment, cooperation in rou- 
tine, work capacity, work initiative 
when alone, work initiative when 
closely supervised, and willingness to 
follow directions. Under each cate- 
gory five brief phrases characterize 
the grades of behavior in the scale. 
The rating grades are none, poor, 
fair, good, and extra good; they are 
weighted from 0 to 4. The total 
score consists of the sum of the 15 
ratings received. The Behavior 
Chart has been found useful in the 
evaluation of change following pre- 
frontal lobotomy (21). 

The Fergus Falls Behavior Rating 
Sheet, prepared by Lucero and Meyer 
(10), was developed to record be- 
havior of patients who are mute, un- 
intelligible, hyperactive, or seclusive. 
Eleven aspects of behavior, such as 
work, response to meals, and response 
to patients, are rated by checking 
one of five descriptions. A value of 
1 is given to the most deviant be- 
havior and a 5 to presumed normal 
behavior.  Thirty-four raters, on 
rating 51 patients, agreed 90 per cent 
of the time even though some of the 
language used in the description ap- 

to be sufficiently ambiguous 
and difficult as to require exte 
training. A similar but briefer de- 
vice, the Norwich Rating Scales, for 
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recording patient behavior on dis- 
turbed wards, has been developed by 
Cohen, Malmo, and Thale (2). The 
ward nurse or attendant checks one 
of five descriptions of activity, ag- 
gressiveness, destructiveness, resis- 
tiveness, talkativeness, and tidiness. 
To check the reliability of the indi- 
vidual scales, 10 patients were rated 
independently by two raters. The 
average interjudge coefficient re- 
ported was .76. 

Rowell (18) has reported a graphic 
rating scale of 20 items for use by 
psychiatrically trained nurses. Indi- 
vidual scales are 5-point continua 
extending from normality on one 
end to pathology on the other. 
Variables such as preoccupation, hal- 
lucinations, delusions, affect, mood, 
blocking, and “‘flight of ideas," are 
rated. The reported immediate test- 
retest reliability for 71 total scores is 
.95, while the correlation between in- 
dependent ratings by nurses for 62 
pairs is .85. The descriptive state- 
ments preceding the scale cues ap- 
pear far too terse and lacking in 
definition for terms such as” blocking 
and flight of ideas that are ill-defined 
and notoriously difficult to judge. 

The Hospital Adjustment Scale, 
an ingenious, carefully constructed 
device for evaluating patient’s be- 
havior, has been developed by Fergu- 
son, McReynolds, and Ballachey 
(11). The scale consists of 91 state- 
ments descriptive of psychiatric pa- 
tients, such as “the patient ignores 
the activities around, him,” or “the 
patient's talk is mostly not sensible.” 
Each statement is marked as True, 
Not True, or Does Not Apply, for a 
given patient, and is keyed in such a 
manner that it is possible to obtain 
a total score indicative of the pa- 
tient’s general level of hospital ad- 
justment. The scale can be filled 
out in about 10 minutes by the psy- 
chiatric aide or nurse most familiar 
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with the day-to-day behavior of the 
patient over a period of two weeks to 
three months. In addition to the 
total adjustment scores, the scale 
also offers measures descriptive of (a) 
communication and interpersonal re- 
lations, (b) care of self and social 
responsibility, and (c) work, recrea- 
tion, and other activities. The state- 
ments can also be grouped as to 
whether they are indicative of an 
"expanding" personality, a ‘‘con- 
tracting" personality, or whether 
they are neutral. The Hospital Ad- 
justment Scale was developed from a 
pool of statements descriptive of pa- 
tients, secured from psychiatric aides. 
Statements were selected for the final 
form on the basis of measures of inter- 
judge reliability, ratings made by 16 
judges on a scale of over-all hospital 
adjustment, checks of discriminative 
power, and percentage of True, Not 
True, and Don't Know checks. 
Norms based on the records of 518 
patients from four hospitals and 
clinics are available in percentile 
form. Patients approaching release 
from hospital can be differentiated 
significantly from those judged to 
be extremely disturbed or chronic 
hospital residents. 

Scherer (20) has prepared a set of 
44 four-point scales for the evaluation 
of patient behavior in such activities 
as occupational therapy, manual 
arts, corrective therapy, educational 
therapy, recreation, and library visits. 
The Activity Rating Scales were con- 
structed on the basis of a longer list 
of patient behaviors and tested out 
on rehabilitation staff personnel. In- 
dependent ratings of the same 
group of patients indicated that, of 
1,188 pairs of independent ratings, 60 
per cent agree completely, 28 per 
cent differ by one scale interval, 3 
per cent by two scale intervals, an 
8 per cent of the scales are not rated. 
Inasmuch as the activities observed 
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in rehabilitative situations are simi- 
lar to noninstitutional, vocational, 
and social activities, the author 
postulates that the behavior exhibited 
may also be similar to and predictive 
of behavior shown in posthospital 
adjustment. 


ScALES FOR UsE BY TRAINED 
CLINICIANS 


The scales and check lists to be 
described in this section were de- 
veloped primarily for use by trained 
clinicians. Most of them represent 
efforts to secure records of currently 
observable behavior, symptoms, com- 
plaints, or inferable needs and atti- 
tudes. A few, however, are intended 
to secure evaluations of social history 
obtained from the patient or the pa- 
tient's friends and relatives. 

The Elgin Prognostic Scale, con- 
structed and validated by Wittman 
and Sternberg (28, 29), is a rating 
schedule designed to predict recovery 
in schizophrenics. It consists of 20 
rating scales weighted according to 
Prognostic importance; favorable fac- 
tors are arbitrarily assigned negative 
Weights and unfavorable factors are 
assigned positive weights. The prog- 
nostic score is the algebraic sum of 
the weighted measures. Most of the 
Variables, such as shut-in personal- 
ity, type of onset, or range of in- 
terests, are based upon premorbid 
ÉL history secured from the pa- 
oam relatives or from the patient 
ore f. E few are based on cur- 
dia discernible symptomatology 
m as hebephrenic symptoms, ideas 

influence, bizarre delusions, or 


a x 
E These scales were con- 
i cted following a review of availa- 


“Ses on prognostic factors in 
Abd phrenia. They were validated 
D Cross-validated on Elgin State 
E patients, and shown to pre- 
wi come of shock treatment with 
T accuracy than staff judg- 


121 


ments. A multiple-factor analysis (6) 
of intercorrelations between the 
scales based on a group of 200 pa- 
tients revealed three interpretable fac- 
tors: schizoid withdrawal, schizo- 
phrenic reality distortion, and per- 
sonality rigidity or inadaptability. 
The principal shortcoming of this de- 
vice is the difficulty of securing relia- 
ble premorbid personality pictures 
from parents and relatives. 

Saslow and his associates (19), in 
their effort to study the personality 
correlates of psychosomatic disorders, 
have prepared some 12 scales for 
measuring habitual patterns of reac- 
tions to crises. Each pattern is 
briefly delineated by means of a 5- 
point scale. Ratings are based on 
social history data secured from pa- 
tients during psychiatric interview. 
Included are scales of impulsiveness, 
subnormal assertiveness, obsessive- 
compulsive behavior, depressive be- 
havior, anxiety, hysteria, inward ex- 
pression of emotions, low awareness 
of body symptoms, insecure feelings 
of inferiority, repressed hostility, and 
strong dependent needs. No relia- 
bility data are reported for the indi- 
vidual scales. 

A check list resembling the Phipps 
Clinic Chart, but more systematic in 
its approach, has been developed by 
Peters (16). The check list, which 
consists of 199 traits, is grouped 
under 7 categories: history, acting, 
talking, mood, emotion, interests, 
ideation. The traits were compiled 
from interview records and presum- 
ably cover a large proportion of all 
characteristics required to describe a 
patient's personality and symptoms. 
Three ratings are used: a plus for a 
positive degree of a trait, no mark at 
all when a trait is present to a normal 
degree or does not pertain to the sub- 
ject, and a minus sign for a negative 
degree of the trait. Ward behavior, 
interview data, and social history 
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may be taken into account in rating. 
The check list has been used success- 
fully to identify traits related to im- 
proved adjustment of lobotomized 
patients (16). 

A mental health check list entitled 
The Pattern for Living has been con- 
structed and reported by Conrad (3). 
The check list is designed to measure 
positive mental health, social con- 
formity, and pathology. It is for use 
by trained personnel in appraising 
persons applying for and receiving 
outpatient psychotherapy. Every 
item regarded as true is checked plus, 
false items are checked minus, while 
the remainder may be scored by a 
question mark. Trends are indicated 
in a separate column. Of the 45 
items, 16 are concerned with positive 
mental health, 12 with conformity, 
and 17 with pathology. Evidence is 
presented that patients with high 
positive mental health scores tend to 
stay in therapy (3). 

A Guided Clinical Interview Anal- 
ysis for use in connection with struc- 
tured clinical interviews has been 
reported by Abt (1). Eight scales 
descriptive of attitude toward par- 
ents, attitude toward siblings, atti- 

tude toward childhood, attitude 
toward people in general, and atti- 
tude toward sex make up the Anal- 
ysis. Each subcategory of a scale is 
ratable on a 5-point scale. Agreement 
between raters for recorded interview 
material is reported to be high. A 
scale similar to the Abt form as to 
content, for use in evaluating adult 
outpatients receiving psychotherapy, 
is being developed by Morse (15). 
The six major areas measured are ac- 
cessibility to therapy, occupational 
and school adjustment, social adjust- 
ment, sexual adjustment, family ad- 
justment, and symptomatology. 
Most of the 40 items contain four 
brief descriptions to characterize 
grades of behavior on the scale con- 
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which represent the usual variation 


tinuum, Thus far neither reliab 
nor validity data are available. 

The Psychiatric Rating Scales 
veloped by Malamud et al. (13), 
the Worcester Hospital in Massae 
setts, represent yet another 
form for the quantitative record 
of psychiatric clinical findings @ 
the changes that occur in them durin 
the course of illness. The scale co 
sists of 19 “functions” divided i 
three major groups. The first 
functions comprise behavior item 
such as sexuality and sleep, whichd 
pend upon continuous observati 
by ward personnel during the 
hours preceding the psychiatric m 
ings. The third group consists 
eight functions that are evaluat 
during the interview, anl require , 
bal reactions from the patients thet 
selves. Associations, memory, à! 
thought processes are examples. 
these eight functions. ; 

Each of the 19 scales extends to th 
left and to the right from a cent 
base line consisting of two term 


in the particular function : 
within normal limits. On both sid 
of the base line are indicated t 
range of deviation in terms of 
gressive degrees of pathology. O 
left side of the scale are those d 
tions which are directed toward 
outside (centrifugal). To the 
the base line are those centrip 
internally directed deviations 
function. Steps in the functio! 
identified by three psychiatric t 
to the left of the base line and 
to the right. Ratings made for € 
half of a function may range 
to 6. The Feeling function, fore 
ple, is marked by the followt 
quence of terms: panic, @ 
guilt, tense-irritable, hyperse 
hyposensitive, phlegmatic, dull, 
thetic. A patient's total score 
sists of the sum of his scores 
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of the 19 functions. A correlation of 
.92 was secured between 100 paired 
independent ratings made by two 
psychiatrists on 26 patients. Applica- 
tion of the Psychiatric Rating Scale 
to agitated depressives (12) indicates 
a good correlation with changes in 
the clinical picture resulting from 
electroshock therapy. The scale has 
also been used to record changes re- 
sulting from prefrontal lobotomy. 

A criticism that may fairly be di- 
rected at the Psychiatric Rating 
Scales is that the terms used to de- 
scribe the 19 functions are nowhere 
defined. This would suggest that 
ratings on individual functions are 
less reliable than the total scores. In 
total scores, differences between 
raters are cancelled out in the process 
of summing. Lockwood (5) reports 
that the evaluations of his psychiatric 
raters as reflected on individual 
scales were not sufficiently reliable for 
use as criteria for clinical improve- 
ment. In a number of the scales 
several variables seem to have been 
forced into a single bipolar contin- 
uum. Thus, instead of one rating 
there should be two on such functions 
as Sexuality. It would have been 
preferable to determine empirically in 
advance whether or not presumably 
antithetical forms of behavior were 
mutually exclusive. 

The Elgin Behavior Rating Scale 
Revised has been developed by 
Wittman and Hills (29) for the pur- 
ites of describing psychiatric pa- 
TS insixareas of behavior. A rating 
E hanna by selecting the de- 

ptive sentence listed under a cate- 
gory which is most applicable to the 
mt The scales are graded from 
: a poo to “very good." A weight 
4s given to the most deviant 

h or; intermediate steps are as- 
Signed weights from 1 to Vip sect 
9. Vior is covered by seven scales 
Ctiptive of physical appearance, 
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physical condition, appetite, and the 
like. The Social Behavior area con- 
sists of six scales descriptive of such 
aspects as conversation, cooperation, 
and sex behavior. There are eight 
scales to describe Mental Behavior 
such as orientation, insight, and af- 
fective response. Under the rubric of 
Psychotic and Neurotic Behavior are 
three global scales descriptive of af- 
fective exaggeration, paranoid pro- 
jection, and schizoid withdrawal. 
Neurotic Behavior and Anti-Social 
Behavior are separately rated on two 
scales. No data with regard to inter- 
rater agreement are reported. Most 
of the scales contain several variables 
for rating. The orientation scale, for 
example, includes items relative to 
disorientation as to time, place, and 
person. There are actually 63 sepa- 
rate variables available for rating. 
Although no data are provided by the 
authors, the total score would proba- 
bly represent a useful measure of | 
over-all severity of illness. 

A series of check lists designed to 
describe patient character, tempera- 
ment, and intellectual capacity have 
also been developed by Wittman (29) 
and her associates. The check list of 
Fundamental Temperament Reac- 
tions postulate three bipolar com- 
ponents. The first component, affec- 
tive exaggeration, extends from 
manic expansion to depressive com- 
striction. Aggressive ascendance and 
defensive passivity mark the two 
ends of a continuum of paranoid com- 
pensation. The schizoid withdrawal 
component extends from heboid re- 
gression to simple withdrawal. Two 
descriptive elements identify manic 
expansion and a similar number are 
used to describe depressive constric- 
tion. The rater may check as few or 
as many elements as he observes; the 
total number of checks provides a 
total score. Wittman has also con- 
structed check lists for Temperament 
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Deficiencies, Anxiety Reactions, 
Character Deficiencies, Exaggerated 
Reaction Types, Addictive Reaction 
Types, Disorders of Intellectual Ca- 
pacity, Constitutional Intelligence 
Disorders (Amentia), and Acquired 
Intelligence Loss (Dementia). 

Wittenborn (23) has reported a 
rating-scale procedure for the evalua- 
tion of mental hospital patients as 
one step in a broad program directed 
at the development of a quantified 
method for multiple pyschiatric diag- 
noses. The procedure was devised to 
permit a psychologist, psychiatrist, 
nurse, or other competent observer 
to (a) rate the currently discernible 
symptoms of a psychiatric hospital 
patient, (b) score these ratings, and 
(c) prepare a profile which would in- 
dicate to what extent the patient’s 
pattern of symptoms resembled the 
symptom patterns found among psy- 
chiatric hospital patients generally. 
The rating schedule consists of 55 dif- 
ferent, unlabeled scales presented 
sequentially in a random manner. 
The scales were selected to represent 
a fairly adequate sample of important 
symptoms that characterize hospital- 
ized patients. They were designed to 
demand a minimum of interpretation 
and experience from the observer, 
and to yield judgments which are 
relatively unbiased by the rater’s 
particular theory or point of view. 
The directions require that the rater 
check the most pathological condition 
or level of behavior observed during 
the period studied and that every 
scale be checked for every patient. 
This last procedure undoubtedly 
facilitates correlational ^ studies. 
However, there is a real question as 
to how much error is introduced if 
judgments are forced as in the case of 
the mute patient or the patient who 
is evasive or irrelevant in his speech. 
A mute patient, for example, must be 
judged as to rate of change of ideas, 
insight, or rate of speech. 
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Most of the scales consist of foi 
elements; a lesser number contai 
three or five elements. The ra 
process, which consists of encircling 
one element for every scale, requir 
about 10 or 15 minutes. 

In an effort to determine the exist 
ence of clusters on patterns of symp 
toms and the influence of such 
acteristics of the sample as age, sex, 
or organic brain damage on these pi 
terns, Wittenborn (24, 25) has 
pleted a series of factorial analyses of 
his rating schedule. The nine factor 
or psychiatric syndromes which have 
been repeatedly identified are: Acute 
Anxiety, Conversion Hysteria, Man 
State, Depressed State, Schizoph 
ic Excitement, Paranoid Condi 
Paranoid Schizophrenic, Hebephi 
ic Schizophrenic, and Phobic-Ci 
pulsive. Scoring weights for 
symptom scales and norms for ti 
muting cluster scores into standard 
cluster scores are available for the. 
Descriptive Scales for Rating Cur- 
rently Discernible Psychopatholos 

The Multidimensional Scale 
Rating Psychiatric Patients 
(MSRPP) consists of two sets OF 
brief, relatively objective rating 
scales for the description of the be 
havior, symptoms, complaints, andi 
inferred motivation of psychiatric 
patients. One form or set is for out- 
patient use and the other for hospi 
use. These schedules represent 8 — 
systematic effort to (a) develop 4 
quantified record or description © 
mentally ill patients that could be 
used to measure change or improve". 
ment, and (b) isolate and identity 
underlying unitary variables. ni 
form for outpatient use, develo A 
Lorr, Jenkins, Holsopple, and Rubin 
stein (8), consists of 49 unlabeled, 
randomly presented, 4- and 6-po 
graphic rating scales for descri! 
outpatients as seen in diagnostic 
therapeutic interview. It is inten 
only for use by psychologists or 97^ 
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chiatrists. The scales, calling for 
judgments based on directly observa- 
ble manifest behavior, are grouped 
together, as are those which require 
inferences. Included are scales de- 
scriptive of personality traits such as 
emotional responsiveness, complaints 
such as headaches, and symptoms 
such as compulsive behavior. The 
schedule provides scores for 16 fac- 
tors identified on an original set of 73 
(9). Tentative norms and standard 
scores for profiling individual patient 
records are available. The factors 
measured by this form have been 
labeled Hostility, Reality Distortion, 
Obsessive-Compulsive Reaction, Sex 
Conflict, Gastro-Intestinal Reaction, 
Cardiorespiratory Reaction, Anxiety- 
Tension, Anxious Depression, Emo- 
tional Responsiveness, Adaptability, 
Sense of Personal Adequacy, Vigor- 
ous Interest, Conscientiousness, In- 
dependent Maturity, Goal-Directed 
Motivation, and Prudence. 

The MSRPP form for hospital use 
represents a revision of the Northport 
Record, which was initially con- 
structed and developed by Lorr, 
Singer, and Zobel (7). It consists of 
30 unlabeled graphic rating scales 
presented in a random order. The 
schedule was designed to measure the 
major symptoms characteristic of rec- 
ognized syndromes of the various 
Psychoses and behavior readily ob- 
Servable in a routine diagnostic inter- 
View or on the ward by nurses and 
Psychiatric aides. A rating is made 
ind encircling the entry which is most 
ul or representative of the pa- 

ent during. the period observed. 
a pe is no basis for rating a 
fria on a particular scale, an un- 

4 e category is encircled. 
rin te hospital form of the MSRPP 

Vides tentative norms, based on 

n Patients from four psychiatric 
is Lu for 12 factors or syndromes 

Sured. These factors were identi- 

in à multiple-factor analysis of 
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55 of the original 81-item Northport 
Record. Standard score measures are 
available for profiling a patient on 
the following factors: Manic Excite- 
ment, Retarded Depression, Anxious 
Depression, Perceptual Distortion, 
Conceptual and Thinking Disorgani- 
zation, Paranoid Suspicion, Grandi- 
ose Expansiveness, Schizophrenic Ex- 
citement, Disorientation, With- 
drawal, Hostile Aggressiveness, and 
Activity Level. The last three of 
these factors are behavior parameters . 
observed on the ward. 

The sum of the absolute deviations 
from a “normal” pattern provides an 
over-all index of severity of illness 
which has been found to reflect 
change resulting from lobotomy. 
Seven type patterns similar to con- 
ventional diagnostic description are 
also available for use as an aid in 
diagnosis. 


DISCUSSION 


The development and use of rating 
scales and check lists for the system- 
atic recording of clinical judgments 
of manifest behavior and inferred at- 
titudes and needs appear to represent 
an important advance for the clini- 
cian and the research worker. 

There seems to be no doubt that 
the interview is here to stay even 
though some critics, particularly the 
psychometricians, would have it re- 
placed by other and presumably more 
rigorous procedures. The problem 
becomes one of developing controlled 
interview patterns as suggested by 
Zubin (31) and of objectively record- 
ing what the trained clinician can 
validly or reliably observe or infer. 
While the new techniques for sound 
recording of interviews are unques- 
tionably important for evaluation, 
they are not substitutes for the clini- 
cian nor do they provide any com- 
plete basis for the analysis of the in- 
terview. Visual as well as auditory 
cues provide important data for an 
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appraisal. We are simply saying, in 
brief, that the clinician can contrib- 
ute toward the description of his 
patient and the prediction of his 
future behavior. The rating form 
provides a framework for quantifying 
his judgments, for jogging his 
memory, and for minimizing "halo" 
bias. 

The rating schedule also offers a 
common conceptual framework for 
the clinician regardless of the exami- 
nation procedure used. Clinical judg- 
ments derived from an analysis of the 
Rorschach test, the TAT, or a sen- 
tence completion form may be re- 
corded in objectified form on the rat- 
ing scales. Ratings can be useful in 
defining and clarifying areas of agree- 
ment and disagreement. Clinicians 
differing in theoretical orientation 
can find a common ground when a 
concept characteristic of an individ- 
ual is stated simply, in graded form. 
When defined in simple understanda- 
ble terms, many presently elusive and 
amorphous variables can be checked 
for reliability and related to a larger 
domain of objectively expressed con- 
cepts. Conceptual formulations often 
loosely used, such as sexual identifica- 
tion or ego strength, can be pinned 
down for closer scrutiny and valida- 
tion. : 

In any field where agreement on 
basic variables is lacking, factor anal- 
ysis is a powerful tool for resolution 
of complex concepts into simpler ele- 
ments and for the identification of 
underlying parameters, ‘ The confu- 
sion in, and duplication of, vocabu- 
laries for describing personality and 
psychodynamics are especially nota- 
ble. However, Moore, Wittenborn, 
and others (14, 25) have shown that 
rating scales and check lists can be 
utilized fruitfully for the isolation 
and identification of psychopatho- 
logical syndromes and categories. 
Investigators concerned with the iso- 
lation of primary factors in percep- 
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tion and cognition have 
recommended a battery of 
represent each of the better defin 
factors in future factorial studies. _ 
the absence of more objective m 
ures of personality and psychopath 
ology, it may be useful to utilize simi: 
larly, as reference variables, standi 
sets of rating scales in further] 
torial investigations. 


SUMMARY 


The purpose of this review has bet 
to examine and report on rating sc 
and check lists designed to d 
psychiatric patients in the inte 
and on the ward that have appeare 
during the past ten years. A ha 
dozen scales and check lists suitabl 
for use by nurses and psychiatric 
aides is reported in the literature. O 
these devices the Hospital A 
ment Scale has been most ca 
developed and seems to be usable by 
any psychiatric aide who can 
For more precise measurement 
change the Gardner Behavior C 
and the Northampton Activity 
ing Scale may be preferable. 

Among those designed for use 
psychologists and psychiatrists to de 
scribe psychotic behavior and s 
tomology, the Wittenborn De 
tive Scales and the Multidimen 
Scales have been most intensh 
analyzed and developed. The 
Prognostic Scales are at present 
most useful for predicting improvt 
ment although difficult to use because 
of the practical problem of secu 
reliable data on the patient's 
history. 

The rating schedule offers con 
able promise as a procedure for q! 
tifying the interview, for isola! 
basic psychopathological param 
and for generally providing a con 
tual basis against which clini 
varying persuasions and training 
find a common ground. A 
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RECENT STUDIES OF SIMPLE REACTION TIME! 


WARREN H. TEICHNER 
Aero- Medical Laboratory, Wright-Patterson Air Force Base 


In spite of the important role that 
the human reaction time study has 
played in the development of psycho- 
logical science and the tremendous 
amount of research effort expended 
in its behalf, there are still large gaps 
in our knowledge of the empirical re- 
lationships in which reaction time is 
involved. This report is an assess- 
ment of the present scientific status 
of the topic based primarily on the 
experimental literature of the last 
twenty years. Previous reviews were 
presented by Woodworth (161) in 
1938 and by Johnson (82) in 1923. 
In addition, Forlano, Barmack, and 
Coakley (50) have reviewed the ef- 
fects of ambient and body tempera- 
ture on both simple and choice reac- 
tion time and Finan, Finan, and 
Hartson (46) have briefly sum- 
marized the use of reaction time 
Scores as measures of performance 
decrement. 

First, it should be noted that there 
are several kinds of reaction time ex- 
periments (8, 161). Because of the 


1 This paper is a revision and extension of 
The Simple Reaction Time, a Review with 
Reference to Air Force Equipment, Wright 
Air Development Center Technical Note 
WCRD 52-47, August 1952, which was written 
when the author was on the staff of the Psy- 
chology Branch, Aero-Medical Laboratory, 
Wright Air Development Center. The writer 
is now with the Human Resources Branch, 
Natick QM Research & Development Labora- 
tory, Natick, Massachusetts. 

2 Thanks are due to many people for their 
comments, and in particular to Mr. Darwin 
Hunt and Dr. Davis Howes of the Aero- 
Medical Laboratory, and Dr. Austin Henschel 
of the Natick QM Research and Development 
Laboratory. 
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tremendous literature involved, the 
present discussion will be restricted to 
the simple reaction time (RT), which 
is the time interval between the onset 
of the stimulus and the initiation of 
the response under the condition that 
S has been instructed to respond as 
rapidly as possible. In order of pres- 
entation this review will consider 
the effects on the RT of stimulus and 
receptor conditions, central and 
motor factors, and certain special 
conditions such as the effects of low 
ambient temperature, loss of sleep, 
etc. 

Next, the complexity of the meas- 
ure should be recognized. After the 
onset of the stimulus there is a lag, or 
latent period, during which the re- 
ceptor process is initiated and builds 
up to a maximum (25, 56, 119, 120, 
140, 141). This is followed by 4 
second lag involving central trans- 
mission of the sensory impulses to the 
motor fibers, and finally, there is à 
time delay involved in the contrac- 
tion of the muscles (33, 34, 47, 67, 
127) and the beginning of the move 
ment of the responding member. Any 
of the factors which affect any of 
these processes will obviously also 
affect the measured RT within which 
they are present. Since most RT 
studies deal only with the over-all 
measure of time, the present discus- 
sion will be confined to this measure. 
However, a discussion of sensory 
latent periods relevant to the visual 
RT may be found in Arnold am 
Tinker (2) and in Strughold (140, 
141), to the auditory RT in Chochell 
(24, 25), to the pain RT in Pattle and 
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Weddell (117), and to the thermal 
RT in Wright (162). Piéron (120) 
summarizes the topic in considerable 
detail. 


SrmmuLus-RECEPTOR FACTORS 


RT as a Function of the Sense Modal- 
ity Stimulated 


It is a common assumption based 
on the neuro-anatomical differences 
existing among the various receptor 
systems that the time of reaction 
varies according to the sense mo- 
dality stimulated. Textbooks usually 
contain comparisons between the 
RT's obtained by various kinds of 
stimulation, frequently presenting 
lists in which RT is ranked according 
to the senses. However, little atten- 
tion appears to have been given to 
the logic of measurement involved in 
making such comparisons. For exam- 
ple, in order to say that the auditory 
RT is shorter than the visual one, as 
is usually done, the two types of 
stimulation must be compared on the 
same scale. The scales that have been 
employed are, unfortunately, scales 
of subjective intensity and these 
cannot be considered comparable 
En one sense modality to another. 

his argument, i.e., that the attribute 
terms of one psychophysical dimen- 
Sion logically cannot be projected to 
another such dimension, represents 
what is usually thought of as an ad- 
vance in the logical foundation of 
ee. and will be remembered 
iind invoked considerable dis- 
Ei Although the problem was 
i ved with regard to the compar- 
ng of sensations, it seems to have 

een ignored in dealing with the RT. 
E. this reason, the conclusion that 
st be drawn is that there is no evi- 
nce available that indicates whether 


* For a compl 
eti i 
see Boring (10). e statement of this problem 
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or not the RT varies according to the 
receptor system stimulated. 

The experimental literature availa- 
ble does allow one kind of meaningful 
conclusion regarding this matter. 
Where studies have been done com- 
paring a specific intensity of, say, 
sound with a specific intensity of, 
say, light, it should be possible to 
decide whether the RT’s for those 
specific values, and those only, are 
shorter for the one sense than for the 
other. Unfortunately, most studies 
making such comparisons either re- 
port no intensity values at all, or 
they report them in arbitrary units 
with no means of reference. 

It is possible, of course, to specu- 
late. The literature concerned with 
visual and auditory RT’s is almost 
universal in reporting faster RT’s to 
the sound stimuli than to other 
stimuli. Most studies have also found 
faster RT's to tactual than to visual 
stimuli. Robinson (125), for exam- 
ple, presented a summary table of 
eight of the older investigations in 
which the RT’s for vision, hearing, 
and touch were compared. If medi- 
ans are calculated from this table, 
the RT for audition is found to be 
0.142 sec., for touch 0.155 sec., and 
for vision 0.194 sec. In all eight of the 
experiments, the auditory RT was 
consistent in being faster than the 
visual one. But in four of the eight, 
the tactual RT was faster than the 
auditory one and in the other four 
the opposite result was obtained. 
Todd (146), and in addition more 
recent studies (4, 16, 38, 48, 153, 
154), concur in finding shorter RT’s 
to sound than to light. One other 
study (Moldenhauer) reviewed by 
Woodworth (161) found the auditory 
RT faster than the tactual one. 
Lanier (92), however, in a study of 
the effect of training on auditory, 
visual, and tactual RT’s found the 
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tactual RT shortest for trained Ss 
with the auditory and visual RT's 
being approximately equal. With 
untrained Ss, on the other hand, the 
auditory RT was shortest with visual 
and tactual times being equal. 

It is possible that the speed of 
reaction with respect to sensory 
modality may depend on some sort of 
speed or reaction factor which deter- 
mines the kind of stimulus to which 
an individual will exhibit the shortest 
RT. Wells, Kelley, and Murphy 
(153, 154) studied the ratio of the 
median RT to light to the median 
RT to sourid and found a light-sound 
ratio of 1.15 for 11 untrained Ss. 
Two trained Ss exhibited ratios of 
1.34 and 1.45. They also found a 
correlation of —.52 between the ra- 
tios and the median RT to sound in 
the untrained group. From this they 
concluded that Ss who have a rela- 
tively fast RT to sound have a rela- 
tively slow RT to light, and vice 
versa. The ratios obtained agree 
fairly well with those recently re- 
ported by Canfield, Comrey, and 
Wilson (16). However, the correla- 
tion of —.52 has little meaning since 
it is logically possible to obtain a 
negative correlation between a ratio 
and its denominator even when the 
direct correlation between the numer- 
ator and the denominator is positive. 
In fact a negative correlation merely 
indicates that the ratio is greater 
than 1.00. More direct comparisons 
by Forbes (48) and by Lanier (92) 
have revealed positive correlations of 
.48 and .90 respectively between re- 
sponse to light and to sound. 

From a consideration of chemical, 
temperature, pressure, or electrical 
stimuli applied directly to the skin to 
elicit pain response, Woodworth (161) 
concluded that the slowest RT is that 
based on painful stimulation. Recent 
studies have also recorded pain RT’s 
to radiant heat (e.g., 61, 139, 156) 
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and radiant cold (35). It is neces- 
sary, however, to consider not only 
the measurement problems involved 
but also the logical and technical dif- 
ficulties involved in the measurement 
of pain (37, 61). 

Wright (162) investigated the RT’s 
associated with cutaneous sensations 
of warmth elicited by visible and in- 
frared radiation. In 22 out of 39 men 
tested, RT was faster to stimulation 
on the back of the hand than it was 
on the palm. In 17 out of 27 others 
who were tested, stimulation in the 
epigastrium produced faster RT's 
than stimulation in the interscapular 
region. In 78 Ss the RT to light was 
found to vary from about 0.33 sec. for 
a very intense sensation to about 20 
sec. at threshold. One interesting re- 
sult was that psychophysical func- 
tions obtained with RT measures 
showed a qualitative similarity to vis- 
ual functions of intensity, duration, 
and area. 

A little work has also been done re- 
cently on proprioceptive RT's. Cher- 
nikoff and Taylor (21) measured the 
speed of reaction to the sudden falling 
of the S’s arm. When the response 
measure was the time of release of a 
telegraph key, no differences were ob- 
tained between auditory, tactual, and 
kinesthetic RT’s. However, when the 
response was the stopping of the arm 
movement, RT was considerably 
shorter than the key-release type of 
reaction regardless of whether the lat- 
ter was to an auditory, tactual, Or 
kinesthetic stimulus. Hick (72) and 
Vince (152) have also studied the 
RT's involved in making corrective 
movements in a pursuit task involv- 
ing both visual and proprioceptive 
components. y 

In a different type of propriocep 
tive study, Baxter and Travis 
measured the vestibular RT's of 31 
Ss to rotary motion of the body. The 
Ss were blindfolded, seated in a con- 
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stant-speed revolving chair, and in- 
structed to press a telegraph key 
upon detecting a change in motion. 
When the chair was moved from a 
stationary position, a mean RT of 
0.52 sec. was obtained; when the 
chair was in motion and the direction 
of motion was changed, a mean RT 
of 0.72 sec. was obtained. From this 
highly significant difference Baxter 
and Travis concluded that the RT to 
perception of motion from rest is 
faster than the RT to perception of 
change in direction of motion. 

The possible variation of RT with 
the sense modality used for stimula- 
tion is a question which has not been 
answered by the studies reviewed. 
As noted above, genuine comparisons 
require the use of common scales and 
such scales have not been employed. 
One possibility presently available is 
to determine the R T's on a statistical 
or probability basis and make com- 
parisons in terms of a probability of 
RT scale. The first step is to deter- 
mine empirically for each sensory mo- 
dality the relationship between RT 
and the intensity dimension used for 
that modality, i.e., the function 
P(RT)=f(1). This relationship is 
Presumably sigmoid. ‘Time of reac- 
tion (RT) can now be determined as 
a function of the probability of reac- 
tion for each case, i.e., the function, 
RT - g[P(RT)] can now be obtained. 
Since P(RT) is common for all mo- 
dalities, this function furnishes a le- 
Eitimate basis of comparison within 
the range, 0< P « 1.00. 


RT as a. Function of the Number of 
Sense Organs Stimulated 


ES. recent work has been done on 
E S eus bisensory stimulation. 
Ces in previous reviews Pof- 
igh BUT (121) reported the RT to 
K aot 0.015 sec. faster for each 
ee e three Ss he used when both 

were stimulated than when only 
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one was stimulated. This is hardly 
conclusive considering the size of his 
sample and the likely error of meas- 
urement. Similarly, Bliss (9) found 
the RT slightly shorter for binaural 
than for monaural stimulation. More 
recently, Smith (136) found faster 
RT's to the apparent movement of 
objects when viewed  binocularly 
than when viewed monocularly. 

Since both visual and auditory phe- 
nomena usually show differences ac- 
cording to whether the stimulation is 
mono- or bisensory, it is reasonable 
that RT's should show corresponding 
differences. The high expectation of 
these differences is probably the rea- 
son why experimenters have been so 
little motivated to provide demon- 
strations of them. This is unfortu- 
nate because neither Poffenberger's 
data nor those of Bliss should be con- 
sidered reliable enough to be suffi- 
cient. 

Simultaneous presentation of stim- 
uli to different senses has also been 
studied. Todd (146) presented light, 
sound, and electric shock singly and 
in simultaneous combinations and 
measured the speed of simple reac- 
tion to each. Combined stimuli in 
every case elicited faster RT's than 
the individual stimuli making up the 
combination. The RT to combined 
sound and light, for example, was not 
only faster than to light alone but 
also faster than to sound alone. The 
shortest RT’s were to the combina- 
tion of all three types of stimulus. 
On the other hand, successive stimula- 
tion of different sense organs pro- 
duced longer RT's than did stimula- 
tion of single sense organs. 


RT as a Function of Number of Re- 
ceptors Stimulated 


According to neurological princi- 
ples of summation, it would be ex- 
pected that the greater the number of 
receptors stimulated, the shorter the 
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latent period and, consequently, the 
shorter the time of reaction. There 
are two studies which support this 
hypothesis. Older data from Froe- 
berg (55) provide some support in the 
visual case. These data show that, as 
the retinal area stimulated is in- 
creased from three to 48 sq. mm., RT 
decreases from 0.195 sec. to 0.180 sec., 
the function appearing to be nega- 
tively accelerated and still decreasing 
at 48 sq. mm. Wright (162) reported 
a similar type of phenomenon for 
thermal RT's. 


RT as a Function of the Location of 
the Stimulation in the Visual Field 


The visual RT varies with the por- 
tion of the visual field stimulated, ac- 
cording to Poffenberger (121). This 
investigator stimulated at 3°, 10°, 
30°, and 45° away from the fovea and 
measured the increase in length of 
RT over the RT obtained from foveal 
stimulation. He found that RT in- 
creased in the temporal periphery 
from about 0.004 sec. at 3° to about 
0.024 sec. at 45°, and in the nasal pe- 
riphery from about 0.004 sec. to 
about 0.015 sec. Except for stimula- 
tion at 3°, increases in RT were con- 
sistently greater for temporal as com- 
pared to nasal stimulation. 

Poffenberger's data suggest that 
RT is positively correlated with abil- 
ity to perceive shape since shape per- 
ception is greatest in the foveal area 
and decreases toward the periphery. 
RT seems to be correlated also with 
visual acuity (reading of numbers, 
letters, etc.) since this too decreases 
with distance away from the fovea 
and is greater in the nasal half. 
These relationships, although not re- 
cently confirmed, present some very 
interesting possibilities for the field of 
visual measurement and suggest, 
among other things, speed-of-seeing 
techniques for the testing of visual 
acuity. Such techniques have had 
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some use in the measurement of acu- 
ity as related to the amount of illumi- 
nation (26, 45, 96). 

Longer RT’s with stimulation in ` 
the peripheral area would not be ex- 
pected in the dark-adapted eye, pe- 
ripheral sensitivity to weak light be- 
ing greater under this condition. 
Data relevant to this last hypothesis 
were obtained by Lemmon and Gei- 
singer (94) who measured the RT's of 
14 Ss at the fovea and 45? away. 
Under light-adapted conditions they 
found the RT significantly longer in 
the periphery, which supports Pof- - 
fenberger's (121) results. When the 
eye was dark-adapted, they found the 
average RT of their Ss slightly shorter 
in the periphery, which is in accord 
with the hypothesis. This result, 
however, was not statistically signifi- 
cant. 


RT as a Function of the Intensity of 
the Stimulus 


Considerable research and theo- 
retical effort have been expended on 
the relationship between RT and 
stimulus intensity. Early studies (7, 
20, 26, 45, 55, 56) all agree that the 
visual RT becomes shorter as the in- 
tensity of light is increased. More re- 
cent investigations (32, 96, 137, 138) 
concur. [n spite of earlier controver- 
sies, there is little doubt that the re- 
lationship is a nonlinear one, not only 
in the case of the visual RT but also 
for auditory (24, 25, 118), gustatory 
(118), thermal (162), and pain (35, 
73, 120) RT's. Attempts have been 
made to fit the intensity data into 
mathematical, theoretical frame- 
works, with exponential, hyperbolic, 
and parabolic functions all being u 
more or less successfully on the same 
sets of data (78, 91, 118, 124). 

In all the intensity studies cited, the 
intensity of the stimulus was vari 
and the speed of reaction measure? 
when the receptor was in a “norm 


SIMPLE REACTION TIME 


condition. In a related, but some- 
what different type of study, Hov- 
land (76) carefully investigated the 
effect on the RT to a test stimulus of 
250 foot-candles of having the eye 
previously adapted to lights ranging 
from zero to 200 foot-candles. As 
might beexpected, RT becameshorter 
as the difference between adaptation 
brightness and stimulus brightness in- 
creased. This result is not altogether 
clear, however, since Lemmon and 
Geisinger (94), who also used a very 
bright test stimulus, found shorter 
RT's with the light-adapted than 
with the dark-adapted eye. 

In addition to varying theintensity 
of the stimulus or of the adapting 
stimulus, it is also possible to vary the 
magnitude of change in the intensity 
of on-going stimulation. The RT has 
been studied in this way only in vi- 
sion, Steinman (137) found that the 
RT became shorter as the relative 
magnitude of the change in intensity 
was increased. The relationship held 
only up to a limit, however, after 
which the RT began to increase 
again. This suggests that the func- 
tion is not monotonic, but rather has 
an optimum at some moderate 
amount of change in intensity. If this 
conclusion is supported, it may pro- 
vide an explanation for the discrep- 
ancy between Hovland’s (76) finding 
a consistent decrease of visual RT 
with greater differences between adap- 
tation intensity and test stimulus in- 
tensity and the opposing result ob- 
tained by Lemmon and Geisinger 
099; ie., the change in stimulus in- 
€nsity in the latter study may have 
ME or beyond that point in the 
vis sai where RT begins to become 

N T. Some support for this hy- 
Pothesis is to be found in a study re- 
Ported by Johnson (81). In this ex- 
rent Johnson darkened one half 

à photometric field by about 4.6 
Per cent and compared the RT's 
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when the surround was 2.25 times as 
bright as the test field, 0.75 times as 
bright as the test field, or too dark to 
be measured. The test field itself had 
a brightness of 7.8 millilamberts and 
was of foveal dimensions. Johnson re- 
ports that the slowest RT was that 
elicited under the darkest condition, 
the next slowest under the brightest 
condition, and the fastest RT was ob- 
tained under the moderately bright 
surround. All differences were statis- 
tically significant. These results, 
along with those of Steinman, indi- 
cate the likelihood of an optimum in- 
tensity change and again suggest an 
explanation for the otherwise contra- 
dictory results of Lemmon and Gei- 
singer. 

RT's have been used effectively to 
study the effect of illumination on 
visual acuity. Luckiesh (96, p. 131) 
and Cobb (26) both found that speed 
of vision (RT) increases rapidly with 
increases in illumination up to about 
18 to 20 foot-lamberts, after which 
further increases in illumination have 
no significant effect. Data from Fer- 
ree and Rand (45) indicate that speed 
of seeing is also a function of size of 
test object. In this study the limit of 
speed reached a maximum at approx- 
imately 25 foot-candles for relatively 
large test objects (3, 4.2, and 5.2 min- 
utes of visual angle) and at approxi- 
mately 45 foot-candles for relatively 
small test objects (1 and 2 minutes of 
visual angle). 


RT as an Index or Measure of Sensa- 
tion 

The study of the effect of changes 
in magnitude of stimulus intensity on 
RT suggests the possibility of apply- 
ing RT measures to psychophysical 
problems. The use of choice reaction 
times in what has been called the 
method of judgment time (17, 40, 
144) is not new, of course. Although 
the use of the RT as a psychophysical 
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measure has always been implicit in 
the design of experiments involving 
visual and auditory thresholds, it has 
only recently had serious employ- 
ment in this regard. 

Steinman (137) studied the ade- 
quacy of the RT to change in bright- 
ness as a psychophysical method. As 
discussed above, he found that RT 
decreased as a (seemingly hyper- 
bolic) function of the magnitude of 
change. With a constant stimulus- 
ratio this relationship was maintained 
up to the higher intensity levels where 
a reversal occurred. Although he at- 
tributed this reversal to an adapta- 
tion effect, other hypotheses are pos- 
sible, as was indicated in the discus- 
sion of the effect of intensity. 

Steinman (137, 138) also observed 
that the RT was faster to a decre- 
mental change in magnitude than it 
was for an objectively equal incre- 
ment of change. This places a restric- 
tion on the method, but it is a restric- 
tion not without parallel in the 
standard psychophysical techniques. 
In any case, Steinman was able to 
plot RT functions which were in close 
agreement with similar functions ob- 
tained by more customary proce- 
dures, and for this reason he con- 
cluded that the method is adequate 
for securing équal perceptibility con- 
tours, threshold measurements, etc. 

Other similar studies have been 
done, most of them quite recently. 
Galifret and Piéron (57) were also 
successful in obtaining visual func- 
tions based on differences among 
RT’s. Chocholle (25) discusses the 
problem relevant to the psychophysi- 
ology of hearing; Wertheimer (156) 
suggests using the RT for obtaining 
radiant heat pain thresholds. Essen- 
tially this has been done both for the 
determination of the radiant heat 
pain threshold (61, 139) and the radi- 
ant cold pain threshold (35). Wright 
(162), furthermore, has obtained the 
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Weber type of function by usi 
taneous RT's as an index of seg 
tions of warmth produced by lig 
radiation. 


RT as a Function of the Duration. 
the Stimulus 


It is difficult to see why the d 
tion of the stimulus should influ 
the RT to the onset of a suprathres 
old stimulus unless some type of 
mation of intensity hypothesis if 
vanced. Nevertheless, there is ! 
suggestion in the literature thats 
ulus duration does have an effect.” 

Froeberg (55) varied visual stimuli 
by equal geometric intervals betwee 
0.003 sec. and 0.048 sec. Within thi 
range he found that the longest dut 
tions produced the shortest RT’ 
function (of the geometric int 
being linear. 

Wells (155) varied the durati 
both of sound and of light stimuli 
For an auditory stimulus he used the 
sound of an electric buzzer with dur 
tions of 0.007, 0.036, 0.051, 0.076, à 
0.106 sec. Two Ss gave 200 respo! 
under each duration. The result 
dicated that RT was a linear funi 
of the logarithm of the dura’ 
the stimulus, which is in agree 
with Froeberg’s visual data. Bui 
though the form of the relation 
was the same as that obtained. 
Froeberg, the slope was in the opposte 
direction. To study the effect € 
duration in the visual case, 
used a constant intensity stim 
(brightness of 0.12 millilamberts, 
five durations ranging between 04 
and 1.00 sec. In this experimen! 
responded to the onset of the 
In a second experiment perfori 
with the same Ss, response was mae 
to the cessation of the light. : 
durations of light were used 
this time ranging between 0.010 
1.00 sec. The results differed | 
those of Froeberg in that they ie 
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cated that there is an optimal dura- 
tion and that this duration varies 
from individual to individual. What- 
ever the individual optimum, RT 
tended to become longer with devi- 
ations from it. The range of optima 
for ten Ss was between 0.025 and 
0.066 sec. However, large variations 
were found not only for the optima 
among Ss, but in the optima between 
stimulus onset and cessation even for 
individual Ss; i.e., the optimum was 
usually not nearly the same for a 
single S under the two conditions. 
Brogden and his students (21, 22, 
62, 63) recently performed a series of 
experiments in which they compared 
the RT to auditory stimuli of fixed 
duration with the RT for response- 
terminated durations. They con- 
cluded that the primary variable is 
stimulus duration and that response 
termination merely acts on this fac- 
tor. However, they also found that 
for longer sound durations (400-2000 
msec.) the RT to the response-termi- 
nated stimulus was slightly, but sig- 
nificantly, shorter than to the fixed 
duration stimulus. For the shorter 
durations (100-200 msec.) there were 
no differences. Scrutiny of their data 
leaves the impression that RT in- 
creased in general as stimulus dura- 
tion was increased from 100 msec. to 
2c andas the duration became 
ks onger (up to 2000 msec.) RT de- 
pn again. This is in agreement 
ko ells's results with visual stim- 
di ut not with Wells's auditory 
a or with Froeberg's visual data. 
ol Co the conflicting results 
um “il ^ does seem as though 
Miki tu Sa ationship between stimu- 
M urs and RT. á The most 
Ee e expectation is that the 
~aon IS asymmetrical in nature, 
idly ir becoming Shorter) rap- 
aa =e duration is lengthened 
ereupon i dpa small time value, 
ses more gradually to 
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become asymptotic to some limit. 
The minimum of this curve is itself 
probably some function of stimulus 
intensity and length of foreperiod. 


RT as a Function of the Onset and of 
the Cessation of the Stimulus 


Another feature of the stimulus 
presentation which has occasioned 
some research, but about which little 
conclusive can be said, is the use of 
the onset and of the cessation of the 
stimulus as the signal for response. 
Both Holmes (74) and Jenkins (79) 
report shorter RT's to the cessation 
of light. On the other hand, Woodrow 
(160) found no differences in speed of 
simple reaction to stimulus onset or 
cessation for either auditory or visual 
stimuli. Wells (155) has shown that 
individual differences are very 
marked here, some Ss reacting more 
quickly to one or the other. Wood- 
worth (161) concludes that there is no 
difference between the two condi- 
tions, but the matter seems far from 
settled. Studies are required which 
control the subjective intensity of 
the stimulation at the time of onset 
and of cessation. The duration of the 
stimulus would appear to be an im- 
portant variable differentially de- 
termining the effectiveness of each 
condition. The readiness of S might 
be expected to be different in ac- 
cordance with the one to which he 
responds. 


CENTRAL AND Motor FACTORS 


Cerebral factors have not been 
studied in a way to make them ap- 
plicable to this discussion, although a 
little work has been done (135). 
Studies of speed of nerve conduction 
have theoretical significance but do 
not allow for generalization since 
they deal with the application of the 
stimulus directly to the nerve fiber 
and by-pass both sensory and motor 
factors. Certain psychological condi- 
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tions which might be of importance, 
e.g., personality factors (12, 151), the 
influence of incentives and punish- 
ments (80), etc., will not be dis- 
cussed owing to the limited research 
that has been done and the great 
complexities of the results. A few 
topics that might have been included 
here have been singled out for the 
next section, Special Factors. 

Regarding the motor system, most 
of the work that has been done on the 
responding mechanism is irrelevant 
in this context. Studies of the refrac- 
tory phase (e.g., 142) are of interest 
only in the case of successive quickly 
elicited responses (39, 123). Most of 
the relevant studies of this sort in- 
volve more complex types of reaction 
time measures. 


RT as a Function of Age and Sex 


The correlation of age and RT has 
received some attention. Jones (84) 
found an increase in speed of reaction 
to sound in boys ranging from 11 to 
14 years. According to this study no 
further increases should be expected 
beyond age 14. Atwell and Elbel (3), 
however, report continued small in- 
creases in speed up to 17, the oldest 
age used in their study. Bellis (6), 
in a study of ages ranging from 4 to 
60 years, observed a general shorten- 
ing of both visual and auditory RT's 
until age 30, after which latencies be- 
gan to grow longer. Even at 60, 
however, he found that RT is still 
faster than it is at 10. Miles (107), 
using three kinds of response (finger 
pressing, finger lifting, and right-foot 
lifting), tested 100 adults between 25 
and 87 years of age and found low 
(0.25 to 0.55) but significant positive 
correlations between age and speed of 
reaction. Elliot and Louttit (38) also 
report a low positive correlation. It 
would appear, therefore, that RT 
does covary with age. "Years old," 


WARREN H. TEICHNER 


however, may not be the best m 
ure of thefage factor. 
Regarding sex differences, 
and Louttit (38) in an investig 
of the braking reaction of men 
women in automobiles reported th: 
men react significantly more quickl 
Bellis’ (6) data, based on both at 
tory and visual stimuli, also fa’ 
males, especially in the age peri 
4-10 and 40-60. Seashore and 
shore (132), in studying the s 
various muscular responses (t 
and left hands, right and left 
jaws), found men significantly fa 
especially after practice. The weig 
of the evidence indicates that a 
difference favoring men does exi 
although this conclusion is har 
likely to end a perennial controver 


RT as a Function of Preparatory Si 


It is reasonable that RT will 
pend on the degree to which S | 
ready to respond. The use of a p 
paratory signal has been shown 
yield faster RT’s than the omissi 
of such a signal (163) or the 
pected presentation of a second s 
nal requiring a second reaction (12 
Consequently, most experiments. 
a ready signal. 

This factor of readiness or set seems 
to depend, among other things, of 
the length of time between the warn- 
ing or ready signal and the sti 
to which the response is made, 
is known as the foreperiod of r 
Breitweiser (11) found definite 
vidual differences in the length of 
optimum foreperiod and reporte 
range of optima between 1.0 and | 
sec. Telford (142), however, tak 
repeated measurements of 29 Ss, 
tained conflicting results when 
found that the average RT inc 
systematically from an optimu 
1.0 sec. to at least 4.0 sec. Telfi 
data also indicated that as the 
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period is reduced from the 1.0 sec. 
optimum to at least 0.5 sec., RT de- 
teriorates markedly. 

The study most frequently quoted 
with regard to the foreperiod is that 
of Woodrow (159). According to this 
study a 2.0-sec. foreperiod is opti- 
mum. In spite of the wide acceptance 
of this figure, there is a question as to 
the significance of Woodrow's results 
since the data were obtained from 
only three Ss and, as they are re- 
ported, do not allow for an estimation 
of the standard errors of the means. 
Freeman and Kendall (54) estimated 
that if Woodrow's standard errors 
(which were not reported) were of the 
same order as those obtained in their 
study, which used four Ss, the differ- 
ence between the 2-sec. and 8-sec. 
intervals obtained by Woodrow 
would have fallen between the 5 and 
10 per cent levels of confidence. Ordi- 
narily this would be sufficient to ac- 
cept the null hypothesis. 

Early studies (85, 158) demon- 
strated that preparatory sets are 
primarily muscular. Livingston (95) 
has shown that the amount of muscu- 
lar tension varies during the fore- 
period. Freeman (52, 53) and Free- 
man and Kendall (54) have investi- 
gated the influence of the amount, 
the locus, and the time of induction of 
muscular tension during the fore- 
Period on the RT. Under conditions 
heavy load (large induced muscu- 
ES uon), a longer foreperiod was 
light * be optimum than under 
RA de conditions. The locus of 
ii Mw was also found to be a 
Mant actor in determining the 
m n pregaratoge interval. Per- 
EE e most interesting finding 
with the = onea interval ges 
on ae of time prior to the 

ES wh at the tension was in- 
time 2 en the amount, locus, and 
nduction of muscular tension 
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are all considered together, they (54) 
found that the optimum foreperiod 
ranged between 4 and 8 sec., depend- 
ing on individual differences. As im- 
plied above, there is reason to believe 
that Woodrow's obtained optimum 
actually may be expressed best as a 
range between 2 and 8 sec. 

Another factor thought to influ- 
ence the readiness of the S to respond 
is whether he is set to respond to the 
stimulus ("sensory attitude") or 
whether he concentrates on the re- 
sponse (‘‘muscular attitude"). Al- 
though the weight of the evidence ap- 
pears to favor the latter, no con- 
trolled studies have been performed 
since Woodworth's (161) review. We 
are forced, therefore, to rest with the 
older, and for the most part less well- 
controlled, studies. These are thor- 
oughly discussed by Woodworth. 

The effect of instructions consti- 
tutes another great unknown which 
presumably determines the readiness 
of the S to respond. Perhaps the most 
serious obstacle to overcome in in- 
vestigations of this factor is not the 
ambiguity of language but an in- 
ability to know and to control S's selí- 
instructions. Some promising work 
has been done, however, in which in- 
structions were varied. Davis (33) 
compared the effect of instructions to 
respond with instructions to not-re- 
spond. Instructions to respond re- 
sulted in higher prestimulus tension 
levels in the responding arm, as 
measured by action potentials, and 
resulted in faster RT’s. Davis also 
noted that instructions to respond 
were made even more effective by 
using more intense stimulation. This, 
however, would be predicted from the 
relationship between RT and stimu- 
lus intensity and may, therefore, be 
independent of the effects of instruc- 
tions. : 

Moore (111) found that instruc- 
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tions have a greater influence on the 
variability of the RT than they do on 
the variability of the speed of move- 
ment of the responding part. When 
Ss were instructed to respond as 
quickly as possible with the fastest 
possible movement, the speed of 
movement remained practically con- 
stant but the speed of reaction varied 
considerably. 

Itisclear that a great many factors 
influence the optimum foreperiod. 
Other variables which should be 
studied in this relation are the in- 
tensity and duration of the stimulus. 
Of these, duration would appear es- 
pecially important since the durations 
of both ready signal and stimulus are 
confounded with the length of the 
foreperiod. No single value seems 
acceptable as the optimum since so 
many conditions are effective. Per- 
haps a more useful concept would be 
a range, the modal point of which 
would shift according to sensory 
modality, stimulus intensity and 
duration, nature of muscular tension, 
kind of instructions, etc. Presently 
available data indicate that such a 
range lies between approximately 1.5 
and 8 sec. 


RT as a Function of Body Position 


One question of much interest is 
whether or not the RT varies accord- 
ing to the position of the body. This 
has especial relevance today in con- 
nection with many military and in- 
dustrial problems. Reference to one 
such study has been found. Munnich 
(114) investigated the RT of several 
responses modeled after various as- 
pects of an airplane pilot's task. 
These RT's were studied with S in 
six different bodily positions: (a) 
seated normally with the back of the 
chair making a 120? angle with the 
seat; (b) stomach downward; (c) head 
downward; (d) back downward; (e) 
on right side; (f) on left side. Un- 
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fortunately, the report was not avail- 
able at the time of writing and the 
abstract does not describe the differ- 
ences, if any, which were obtained 
between positions. One interesting 
result, however, is described. It was 
found that RT increased in general 
after any change in position, but as 
the new position was maintained the 
RT's tended to return to the values 
normal for it. The generality of this 
conclusion is offset by the finding 
that some Ss improved their speed of 
reaction even after being changed to 
the most uncomfortable positions. 


RT as a Function of the Responding 
Member 


Of great theoretical and applied 
importance is the question of whether 
the RT varies with the body member 
which responds and with the type of 
motion involved. Regarding the lat- 
ter, most of the responses which have 
been used have consisted of the sim- 
ple release of a telegraph or similar 
key, or, less frequently, the depres- 
sion of such a key. Other types of 
hand motion have also been used and, 
in addition, such other body parts as 
the eye, mouth (both movement of 
the jaws and the verbal response), 
arm, leg, foot, or the entire body. 
Although the types of response are 
usually as simple as possible and the 
methods of measurement usually 
some sort of chronoscopic device, 
both of these vary somewhat from 
one experiment to another. Discus- 
sions of the experimental techniques 
used and of the effects of these tech- 
niques may be found in several places 
(8, 11, 161). Whether the equivocal 
results obtained from studies of this 
type are really due to differences 1? 
technique is hard to say. k 

Woodworth (161), in reviewing 
this problem, presented a number 0 
studies in which no differences were 
found between body members 
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Baxter's (4) data support these re- 
sults. On the other hand, Féré (44) 
found that the RT for the left hand 
was slower than that for the right, 
and that the sum of the RT's of each 
hand was larger when S tried to re- 
spond with both hands at once than 
when he moved them successively. 
Cattell (20) also observed that the 
RT for a particular task differs with 
the body member used, at least for 
the wrist, forearm, and shoulder. 
Hathaway (66) obtained results indi- 
cating that RT is longer for move- 
ment of the entire arm than it is for 
finger movement. Seashore and Sea- 
shore (132) reported that RT with 
` the left hand was slower than with 
the right, and, similarly, the left foot 
was slower than the right foot. They 
also found that speed of reaction of 
the jaws was greater than either the 
hands or feet. Chernikoff and Taylor 
(23) presented data showing that 
when the stimulus is the sudden dis- 
placement of S’s arm, his RT is 
shorter when he stops the movement 
than when he releases a telegraph 
key. Considering the contradictions 
in the various sets of data, it cer- 
tainly seems too early to draw even a 
tentative conclusion about the role 
played, if any, by the kind of re- 
sponding member. It is possible that 
the only real differences are due to 
differences in inertia of the respond- 
Ing musculatures. 

Woodworth (161) suggested that 
complex movements produce longer 
RT’s than do more simple ones. Not 
many studies were available to sup- 
aly this, but on the basis of those 
ew that were, Woodworth made two 
Specific hypotheses: (a) that guided 
(ined) movements have longer RT's 

an those made freely; (b) that the 
E is shorter for the initiation of a 
E wr than it is for the stopping 
tion or for the changing of its direc- 
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No studies were found bearing di- 
rectly on either of these hypotheses, 
but there are some results which are 
relevant to the general hypothesis 
that RT is somehow related to the 
response movement that follows it. 
Searle and Taylor (129) investigated 
the RT of corrective tracking move- 
ments made with a small knob to the 
displacement of a visual stimulus and 
found that this RT was not affected 
by the amount of knob movement re- 
quired to make a correction. Brown 
and Slater-Hammel (13) reported 
that the RT involved in making dis- 
crete arm movements in the hori- 
zontal plane is independent of the 
length and direction of the move- 
ment. Henry (71) obtained data 
which indicate that RT’s and move- 
ment times are not related. Except 
for the possible effect of the prepara- 
tory set, it is difficult to see why RT 
and consequent movement should be 
related. The results available indi- 
cate that they are not. 


Other Ceniral- Motor Factors 


Studies of extraneous tension dur- 
ing the occurrence of the response ap- 
pear to be of only incidental interest 
here. Action potentials taken from 
muscles not involved in the response 
are not related to the speed of re- 
action. according to Meyer (105, 
106). Daniel (30), however, con- 
sidered Meyer's conclusion not war- 
ranted by his data, and Henderson 
(69) reported a relationship between 
speed of reaction of one arm and ac- 
tion potentials taken from the noh- 
participating arm. 

Of greater interest in this context is 
the finding that lowered muscular 
tension produces longer RT’s (86, 87, 
147). This finding appears to be re- 
lated to the problem of the prepara- 
tory set and to the effects of practice. 
A few writers (e.g., 69) have reported 
increased speeds with practice. It is 
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possible that these increases may be 
due, not to the effect of learning on 
the RT itself, but to the effect of 
learning on the preparatory interval. 
This latter effect may consist of the 
learning of an optimal anticipatory 
muscular tension. 

The RT exhibits considerable vari- 
ability among individuals (41, 60, 92, 
161) but is, nevertheless, reasonably 
consistent. Reported reliability co- 
efficients range between 0.83 and 0.92 
(41, 42, 59, 115, 130, 131). RT meas- 
ures also show considerable varia- 
bility among experimental studies. 
Both types of variability may be due, 
in part, to the points on the tremor 
cycle at which the stimulus is pre- 
sented (145). In spite of variability, 
relatively high — intercorrelations 
among different kinds of RT tests 
have been obtained (42), which sug- 
gests a common factor present in the 
various tests, and, according to Sea- 
shore, Buxton, and McCollom (130), 
such a. factor can be analyzed. 
Further evidence supporting the no- 
tion of an RT factor comes from 
studies showing no relationship be- 
tween RT and intelligence (42, 133), 
substitution (133), card sorting (133), 
discrimination reaction time (42, 
130), pursuit rotor performance (42, 
130), and little or no relationship to 
tapping tests (130). Slocombe and 
Brakeman (134) have also suggested 
that RT measures yield a group fac- 
tor and, in addition, that this factor 
may be used to discriminate, at least 
among motormen, between good and 
poor accident risks. 


SPECIAL FAcTORS 


Certain conditions, such as expo- 
sure to extreme climatic conditions, 
the effects of drugs, prolonged effort, 
starvation, deprivation of sleep for 
long periods, etc., are singled out here 
because of the unique interest which 
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they hold for many investigators, 
Some, if not all, of these could have 
been included in the previous section, 


RT as a Function of Prolonged Readi- 
ness 


Mackworth (102) in a series of 
studies of military personnel under 
conditions of prolonged vigilance 
found that the RT to the double- 
jumps of a clock hand increased 
sharply after one-half hour of watch- 
ing. This increase could be prevented 
by the use of Benzedrine or by in- 
forming S of the results of the test. 

In a related study Kennedy and 
Travis (87) measured the action po- 
tentials and the RT to combined 
light and sound stimuli under long 
monotonous conditions. Their re- 
sults indicate that the frequency of 
action potentials decreases and the 
RT to irregularly presented stimula- 
tion becomes longer as the testing 
period is lengthened. After falling 
asleep, one stimulus presentation was 
sufficient to awaken S, the RT on this 
occasion being considerably longer, 
as might be expected. However, the 
RT approached its normal level 
rapidly with subsequent presenta- 
tions. 


RT as a Function of Certain Common 
Drugs 


The RT has been used as the “I 
told you so" of all those who would 
restrict the use of the common drugs 
In spite of this, the effects of these 
drugs on RT are not always clear 
when the conditions of the exper 
ment are carefully considered. Some 
of the experimental problems are dis- 
cussed by Hull (77) and more T€ 
cently by Gray and Trowbridge (59) 
and by Miller (109). No real attempt 
will be made here to evaluate this 
type of effect but merely to point out 
a few representative papers. 


Vith respect to coffee drinking, 
(68) and Gilliland and Nelson 
claimed lengthened RT's. On 
the other hand, there are studies such 
i that of Thornton et al. (143) in 

lich no effect was claimed. Ciga- 
smoking appears to decrease the 
bility of visual RT's according 
both Hull (77) and Fay (43), but 
no other reliable effect. Alco- 
lon the other hand, both increases 
jability and lengthens visual (108, 
(50) and auditory (150) RT's. Benzed- 
appears to have little or no ef- 
on the auditory RT (143). Mack- 
(102), however, did find that 
edrine tended to offset decre- 
ts produced by prolonged vigi- 
nce. Aspirin has no effect on either 
mal or auditory RT's (31). A 30 
cent saturation of carbon mon- 
is required to produce an in- 
e in visual RT (49). Finally, 
hine has the effect of first 
ortening and then lengthening RT 
cept when taken in large doses, in 
ch case only the latter effect oc- 
s (101). 


as a Function of Temperature 


A number of studies (e.g., 75, 110, 
7) have been done to determine the 
ects of climatic stress, temperature 
Particular, on the RT. The general 
t of these studies is that ambient 
mperatures between a range of 
EST. and 117°F, have little or no 
ect on either RT or more complex 
3 tion times. This conclusion was 
ed by Forlano, Barmack, and 
ey (50) after a careful review of 
e effects of ambient and body tem- 
tures on RT. Most of the studies 
ble for evaluation, however, 
e distinguished by the degree with 

several main variables are con- 
ded in one experiment, and con- 
ently are given to difficulty of 
‘pretation. Such a conclusion, 
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therefore, should not be accepted as 
firmly established. 

RT's have also had a little atten- 
tion with regard to skin tempera- 
tures. Craik and Macpherson (29) 
report that cooling of the hand with 
which the response is made may in- 
crease RT by 10-15 per cent. This 
conclusion is not reasonable on the 
basis of their study since an increase 
of this much turns out to be an in- 
crease of 0.02 to 0.06 sec., a change 
which has little significance in terms 
of the likely error of measurement 
and the size of the sample (two 5s). 

A few RT studies have been done 
with body temperature and/or time 
of day as the independent variable. 
In general, these studies (89, 90, 104) 
suggest that RT exhibits a slight 
diurnal variation, but with large in- 
dividual differences. The data may 
also be interpreted to indicate that 
RT is a function of body temperature 
and is only spuriously correlated with 
time of day (50). 


Effects of Sleep Conditions on RT 


Most of the studies in which the 
stimulus for the RT has been pre- 
sented during sleep have been related 
to the problem of determining the on- 
set or the depth of sleep. Since the 
detection of both of these conditions 
still lacks other independent criteria, 
studies of this sort have little validity 
with respect to generalizations about 
the effect of sleeping on the RT. A 
discussion of this problem may be 
found in Kleitman (88). 

Mullin and Kleitman (113) used a 
criterion of sleep onset which was in- 
dependent of the RT and then in- 
vestigated the change in a verbal, 
auditory RT for periods of time fol- 
lowing onset. In this experiment, on- 
set of sleep was considered to be 
established when S released a piece 
of paper which he held in his hand. 
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Using both normal and feebleminded 
adults and normal children as Ss, 
Mullin and Kleitman found a slow 
increase in the verbal RT for the first 
25 min. after onset, followed by a 
stable period of long RT's, and then 
by a third period of rapid shortening 
of the RT. This cycle, which was 
sigmoid in nature, was completed 
during the first hour after onset and 
did not reappear during the rest of 
the night. 

Considerably more work has been 
done concerning the effect of lack of 
sleep on RT. Although most of the 
studies were concerned primarily 
with more complex reaction times, 
enough have been done with the RT 
to allow some evaluation. 

As long ago as 1896 Patrick and 
Gilbert (116) tested the auditory and 
visual RT’s of three Ss deprived of 
sleep for 90 hours and reported a 
marked slowing of speed. Later, 
Robinson and Herrman (126), in a 
more rigorous experiment, found that 
experimental insomnia produced no 
consistent effect on RT. With the 
exception of one or two studies re- 
viewed by Kleitman (88), this latter 
result has marked the literature ever 
since, not only with respect to RT 
measures but practically all measures 
taken. Part of the difficulty might be 
blamed on the small samples usually 
used. On the other hand, large sam- 
ples are administratively extremely 
difficult to achieve in this type of in- 
vestigation. Lee and Kleitman (93), 
for example, tested the RT of only 
one S following 114 hours of sleep 
deprivation. This one S showed no 
effect. Cooperman, Mullin, and Kleit- 
man (28) report no change in audi- 
tory RT for six Ss after 60 hours of 
privation, as did Tyler (149) after 
24-114 hours of sleep loss. 

Edwards (36) reported a study in- 
volving relatively large samples. He 
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compared the auditory RT of 17 Ss 
deprived of sleep for 100 hours with 
that of ten control Ss. His data sup- 
port the studies already cited since 
he found no significant differences be- 
tween the groups. It seems, therefore, 
that although extremely small sam- 
ples may contribute to the variability 
of results, they cannot be considered 
the major reason for the failure of 
more recent studies to confirm the 
results of Patrick and Gilbert. 

Most writers, when discussing the 
general failure to show that sleep loss 
influences RT, invoke ‘‘compensa- 
tion" on the part of S as an explana- 
tory concept. What is usually meant 
is that S achieves the same perform- 
ance under stress as he does when not 
under stress by expending greater ef- 
fort. Granting the looseness of this 
argument, the possibility of testing it 
appears available. If one is willing to 
accept greater muscular tension dur- 
ing the foreperiod and during the RT 
interval as a measure of greater effort, 
it should be relatively easy to deter- 
mine by measuring tension during 
these intervals if compensation 18 
really a useful concept. 


Other Special Effects 


One experiment was found in which 
the effect on the RT of radial acceler- 
ation was studied. Canfield, Comrey: 
and Wilson (16) investigated the ef 
fects of positive acceleration forces 0 
1, 3, and 5 g on the RT’s to light and 
also to sound. Both kinds of RT were 
found to lengthen significantly wit 
increased acceleration. Moreover 
the slowing of reaction observed was 
not due to decrements in sensory 
functioning as the Ss were neithi 
“blacked out” nor “greyed out” and, 
in addition, the Ss did not notice any 
change in sensory functioning. 

In a different type of study, Tuttle 


et al. (148) reported that omissio® 
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of breakfast increased the visual RT 
of their ive Ss. In a more careful nu- 
trition study involving restriction of 
vitamin B, BroZek et al. (12) found 
that partial restriction of vitamin B 
intake for 23 days had no effect on 
the auditory RT of their eight Ss. 
Only prolonged and severe depriva- 
tion produced significant decrements. 

McFarland (97, 98, 99, 100) has 
shown that RT is affected by altitude 
(anoxia), but not significantly before 
atleast 20,000 ft. Variability of RT, 
however, begins to increase at 2000— 


3000 ft. before the average RT shows ' 


a decrement. McFarland (97) has 
also reported that people who live on 
mountains at extreme altitudes have 
longer and more variable RT’s than 
people living at sea level in the same 
region. 


SuMMARY AND CONCLUSIONS 


Factors other than those discussed 
would appear to be important, but 
have not been studied in relation to 
the RT. For example, it might be ex- 
pected that RT would decrease as a 
function of speed of travel or stimula- 
tion across the retina or the skin. The 
condition of the receptor and of the 
responding member are still among 
the conditions which have received 
sufficient attention. In spite of a 
Ey of research, the effect of 

ensory s i i 
Dian ystem is still an open 
poy of our notions about the RT 
a upon what are now classical 
thea " studies which did not have 
fece eee of modern statistical 
A um The question of mono- 
this d isensory stimulation falls in 
Fu ass. Research is also required 

— the effect of stimulus 
ia d and the interaction be- 
ibd. uration and intensity. Re- 
dm 0 this is the problem of area or 

€r of receptors stimulated, and 
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this too is in need of further experi- 
mental investigation. 

On the positive side, intensity 
functions are relatively well estab- 
lished. The last 20 years have yielded 
a considerable amount of information 
about the effects of age and sex. They 
have also produced more definitive 
information about the role of the re- 
sponding member, and about the 
factors involved in the foreperiod, al- 
though a great deal more research 
must yet be performed before the 
latter is really well understood. 

The incentive to study the effects 
of sleep loss, drugs, temperature, alti- 
tude, acceleration, vigilance, etc. to a 
large extent has come from applied 
areas. Few, if any, safe generaliza- 
tions are yet available. However, 
these topics will undoubtedly con- 
tinue to receive a great deal of atten- 
tion since many of them constitute 
important military problems. 

AII things considered, the last two 
decades have been quite productive, 
not of mathematically expressed em- 
pirical laws, but of useful nonmathe- 
matical generalizations and of clearer 
definition of the problems than was 
available before. The present status 
of the RT study may best be evalu- 
ated in terms of the generalizations 
which it has yielded. In the opinion 
of the writer the following generaliza- 
tions appear to have been reason- 
ably well established. 

1. There is a positive correlation 
between the visual and the auditory 
RT. 

2. Simultaneous stimulation of 
more than one sense modality pro- 
duces faster RT’s than stimulation of 
just one. On the other hand, succes- 
sive stimulation of different senses 
produces slower RT's than stimula- 
tion of a single sensory channel. 

3. For visual and thermal RT's the 
greater the extent of the stimulus in 
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space, i.e. the greater the number of 
receptors stimulated, the faster the 
speed of reaction up to some limit. 

4. Under daylight or illuminated 
conditions the visual RT becomes 
longer the greater the distance of 
stimulation from the fovea. 

5. In the case of each receptor sys- 
tem, RT is a negatively accelerated 
decreasing function of intensity up to 
some maximum intensity value after 
which RT either becomes suddenly 
lengthened, the function at this point 
being discontinuous, or asymptotic to 
a physiological limit. 

6. RT is a slowly falling growth 
function of chronological age until 
about 30 years after which it is a 
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slowly rising function. 

7. In general the RT of the human 
male is faster than that of the female. 

8. The optimum foreperiod of RT 
may be thought of as lying in a range 
between approximately 1.5 and 8.0 
sec. Its position in this range is de- 
termined by a large number of factors 
including the duration and intensity 
of the warning signal and of the stim- 
ulus, and the amount, locus, and time 
of production of muscular tension. 

9. RT is not related to the length, 
direction, or speed of movement of 
the responding member. 

10. Under vigilance conditions, the 
longer the period during which S 
must respond, the longer the RT. 
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University of Colorado 


The purpose of this article is to il- 
lustrate how the application of tradi- 
tional, systematic, rather than repre- 
sentative, experimental design to 
problems in clinical psychology re- 
sults in unjustified conclusions.! Five 
examples, all drawn from attempts to 
discover the effect of the examiner on 
the subjects' responses, are presented. 
In each case it will be shown that 
justified conclusions regarding the 
problems being investigated could 
have been drawn if representative 
rather than systematic design had 
been used. The presentation is in- 
tended to be illustrative rather than 
exhaustive. 

Consider first the simplest form of 
systematic design—the classical one- 
variable design. Both Brunswik (3, 
p. 8) and Fisher (11, p. 88) point out 
that this design is inherited from 
classical physics and both agree that 
revision of this procedure is neces- 
sary. Fisher, for example, remarks: 

In expositions of the scientific use of ex- 
perimentation it is frequent to find an exces- 


1 The two types of design are juxtaposed 
here in conformity with a distinction estab- 
lished by Brunswik (3). In the sense in which 
he uses the terms, "representative design" 
refers to the transfer of the principles"of 
sampling statistics from the subjects of a 
psychological investigation to the objects or 
situations which constitute the stimuli in the 
investigation. The arbitrary orderliness with 
which these external (independent) variables 
are customarily handled is summarized by 
Brunswik under the opposite heading of sys- 
tematic design,’ and’ Fisher's factorial design 
(11) is presented as a relatively recent and 
relatively complex example. Fisher has also 
used the term "systematic" in a similar, 
albeit somewhat casual, manner (12). 
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sive stress laid on the importance of varying 
the essential conditions only one ata time. ... 
This ideal doctrine seems to be more nearly 
related to expositions of elementary physical 
theory than to laboratory practice in any 
branch of research. In experiments merely 
designed to illustrate or demonstrate simple 
laws, connecting cause and effect, the relation- 
ships of which with the laws relating to other 
causes are already known, it provides a means 
by which the student may apprehend the re- 
lationship, with which he is to familiarize 
himself, in as simple a manner as possible. By 
contrast, in the state of knowledge or ig- 
norance in which genuine research, intended 
to advance knowledge, has to be carried on, 
this gone formula is not very helpful (11, 
p. 88). 


Brunswik and Fisher agree con- 
cerning the disadvantages of classical 
experimental design, but they differ 
as to the remedy. "Their differences 
lie not so much in the general nature 
of the reform needed as in the extent 
of the reform. Thus, although 
Fisher's dissatisfaction led him to de- 
velop the multivariate analysis of 
variance and related techniques, 
Brunswik's efforts have resulted in a 
more thorough and more radical re- 
vision of experimental methodology. 
For, although Fisher urged multi- 
variation of conditions in the experi- 
ment (wherein results are obtained), 
thereby approaching one of the con- 
ditions permitting the application o 
results, Brunswik urges that in order 
to eliminate the traditional artificial- 
ity in the choice and manner of varia- 
tion of the experimental variables, 
the conditions of the experiment 
represent statistically the universe © 
situations toward which one wishes to 
generalize. 


REPRESENTATIVE VERSUS SYSTEMATIC DESIGN 


In suggesting that the logic of 
sampling theory (which psychologists 
have long applied to populations of 
subjects) be applied to stimulus situa- 
tions, Brunswik brings experimental 
methodology in line with the modern 
statistical approach to the problem 
of inductive inference. For example, 
“it is clear that the statistical signifi- 
cance of a result may be investigated 
in both directions" (3, p. 36). Situa- 
tional (or ecological") generality of 
results, therefore, is as much a chal- 
lenge to statistical scrutiny: as the 
subject-populational generality of re- 
sults. As Brunswik points out, not 
only should we be concerned with the 
number of stimulus variables in- 
cluded in the experiment, but we 
must also consider the manner of 
variation of the stimulus variables and 
their covariation among one another , a 
problem which Fisher, in his concern 
with the mathematical problems in- 
volved in increasing the number of 
variables, did not consider as fully 
as psychologists might wish. 

The above is an extremely cursory 
description of the fundamental no- 
tion of representative design. The 
examples from clinical psychology 
presented later, however, will further 
clarity its meaning. For a complete 
exposition of representative design, 
the reader is referred to Brunswik's 
monograph (3). From the field of 
academic psychology—notably from 
the perceptual constancies, depth 
perception, gestalt problems and il- 
lusions, and social perception—the 
reader may find concrete examples of 
Tepresentatively designed experi- 
ments and ecological surveys in (2; 3, 
EM 41-52; 6; 9). The general 

asis for representative design as 
m by the probabilistic nature of 
ehavioral adjustment is presented 
v Brunswik in (1). Further discus- 
lon of representative design and of 
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its place in the historical development 
of psychology may be found in (5). 

An analogy between the discovery, 
on the part of relativity physics, of 
the confinement of traditional physi- 
cal laws to a limited universe of con- 
ditions, on the one hand, and the con- 
siderations in psychology that have 
led to the establishment of represen- 
tative design, on the other, has been 
pointed out by Hammond (16). In a 
note to this analogy, Brunswik (4) 
has endorsed the writer's interpreta- 
tion that in spite of the apparent 
stress on "theory," in the first case, 
and on "design," in the second, the 
major problem in both cases is the 
same, that is, "generalization." 

Another paper by Hammond (15) 
constitutes an application of the 
principles of representative design to 
certain generalization problems fre- 
quently mismanaged in current social 
and clinical psychology. This paper 
endeavored to criticize a concrete 
example of a research project, an in- 
terview study by Robinson and 
Rohde (21). In this study the sub- 
jects had been properly sampled but 
the sampling of the interviewers had 
been ignored; yet the conclusions 
concerning the "significance" of the 
results were extended to both the 
subject and the interviewer popula- 
tions. The point made by the'present 
writer in criticism of this report was 
that the interviewers should have 
been treated as "objects," that is, 
as parts of the “situations,” and thus 
under the rules of representative de- 
sign should have been sampled in the 
same manner as were the subjects 
proper; or else the generalizations 
should have been limited to the sub- 
ject population. 


2 In pointing out the wide discrepancies be- 
tween the large numbers of "judges" (re- 
sponders or subjects proper) and the small 
numbers of "social objects" (subjects func- 
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In the present paper we wish to 
extend our criticism of research re- 
ported in the literature to the inap- 
propriate application of systematic 
design to the type of experiment 
which seeks to determine the effect of 
the examiner on the subjects' re- 
sponses. 


OneE-Factor SYSTEMATIC DESIGN 


In the examples discussed below, 
the attempt is made to vary one fac- 
tor, such as the race or sex of the 
examiner, in order to find the effect 
of the stimulus variable in question 
on the subjects' responses. 

Effect of race. Reiss, Schwartz, and 
Cottingham (20) were concerned 
mainly with verifying the utility of 
the (Thompson) Negro version of the 
TAT, but were also concerned with 
discovering the effect of the race of 
the examiner on the responses of the 
subjects to the Thompson TAT. 
"Fifteen Negro and 15 white students 
were tested by a Negro administrator 
and 15 Negro and 15 white students 
were tested by a white administra- 
tor" (20, p. 704). Thus, the inde- 
pendent variable of race is consti- 
tuted by one representative from each 
race. In light of that fact, consider 
the following conclusion drawn by 
the authors: "Northern white Ss 
produce longer stories than do North- 
ern Negroes when the stimulus ma- 
terial offers Negro figures regardless 
of the color of ihe examiner" (italics 
ours) (20, p. 708). 

Note that this conclusion follows 
from the comparison of the stories of 
white and Negro subjects to one white 


tioning as stimuli) typically found in the 
literature on social perception from photo- 
graphs, Brunswik (3, p. 38 and Table 2) has 
suggested the use of the lower case letter n 
for the size of the responder sample and of the 
capital letter N for the size of the ecological 
or object sample. This system has been used 
in the present paper also. 
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and one Negro examiner. It is appar- 
ent that while there is a potentially 
adequate sample of n=60 subjects, 
the size of the object (or ecological) 
sample is thus a bare N —2 (for the 
use of n vs. N see footnote 2). If, in 
fact, "it is clear that the statistical 
significance of a result may be in- 
vestigated in both directions" (3, p. 
36), then we may legitimately ask 
how it is possible to generalize from 
results obtained with one representa- 
tive of each race as a stimulus, any 
more than if the experiment included 
only one member from each race asa 
subject? Generalization from sample 
to population is necessary on both 
sides of this experiment and the sta- 
tistical rules which permit generaliza- 
tion hold under both conditions. For 
Reiss et al. to compare the results ob- 
tained under the conditions described 
above with Thompson's results is just 
as meaningless as if the two expert- 
ments were each carried out with a 
single subject. Clearly, an invalid 
generalization is drawn in the above 
experiment. Only a representative 
type of experimental design in which 
adequate consideration of the situa- 
tion to which the experimenter 
wishes to generalize is given woul 

permit valid generalization concern- 
ing the effect of the stimuli. 

Effect of sex. Curtis and Wolf re- 
port an experiment in which the prob- 
lem is stated as follows: "To study 
the influence of the sex of the exam- 
iner on the production of sex ré 
sponses on the Rorschach" (8, P. 
345). The subject-population sample 
consisted of 586 Rorschach records. 
The independent variable, sex of the 
examiner, was constituted by three 
female and seven male examiners: 
The conclusions are: “There is a 818- 
nificant difference between our malt 
and female examiners on the number 
of records with sex responses.” 
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This experiment suffers from the 
same lack of attention to the estab- 
lishment of an independent variable 
as the experiment discussed earlier. 
Note the elaborate concern of the 
authors to establish subject-popula- 
tional generality (n —586). Contrast 
this sampling procedure with that on 
the stimulus side (male N —7, female 
N23). Yet it is the latter sample 
which constitutes the independent 
variable and which therefore leads to 
the conclusion that the sex of the 
examiner influences the response of 
the subject. This type of imbalance 
(favoring n over N) is characteristic 
of the indiscriminate application of 
systematic design. Again, there is a 
clearly invalid generalization on the 
stimulus side of the experiment which 
only representative design can 
remedy. 

An approximation to representative 
design. An example of an “effect of 
the examiner" experiment which ap- 
proximates representative design 
may be found in Gibby (14). Twelve 
examiners each tested 20 patients 
under one set of conditions, and in 
the second set of conditions “one 
hundred thirty-five subjects were 
used, each randomly assigned to one 
of nine examiners” (14, p. 450). Al- 
though this experiment also shows 
marked imbalance (subject m’s equal- 
ing 240 and 135, and object N’s 
equaling 12 and 9), a better approxi- 
mation is obtained here than in any 
other experiment of this sort seen by 
the writer. Gibby's experiment is 
Cited mainly to illustrate that repre- 
Sentative design does not present in- 
Superable difficulties. 
he further point. If the experi- 
thik er wishes to limit his conclusions 

‘ne particular conditions of the ex- 
ut (as is frequently the case in 
ue led problems), we have no criti- 
ism. If, for example, Reiss et al. 


wish to point out that this examiner 
produces (or does not produce) differ- 
ent responses than that examiner, we 
have no criticism. Generalizations 
such as those made in the above ex- 
periments, however, demand ran- 
domization—and randomization is 
what is lacking. í 

In summary, unwarranted conclu- 
sions concerning the effect of the 
examiner occur as a result of failure 
to establish the crucial independent 
variable. Itisimportant to note that 
failure to establish the independent 
variable occurs as a result of applying 
the logic of statistical inference to one 
side of the experiment only. 


MULTIFACTOR SYSTEMATIC DESIGN 


This section is concerned with 
multifactor systematic design (fac- 
torial design) methods and repre- 
sentative design. We begin with an 
example from the same field of re- 
search as in the previous section. 

Sex and method of administration. 
Garfield, Blek, and Melker (13) were 
concerned with ascertaining the effect 
of the sex of the examiner and two 
methods of administration (complete 
session and interrupted session) upon 
TAT stories. Two male and two fe- 
male examiners constituted the sex 
variable and one of each was assigned 
to the different methods. The experi- 
menters also wished to discover 
whether differential effects would be 
obtained with male (7=54) and fe- 
male (n=56) subjects. Although the 
experimenters do not treat their re- 
sults completely in terms of a fac- 
torial design, most readers will recog- 
nize that the conditions of the experi- 
ment allow use of a 2X22 design as 
in Table 1. For purposes of illustra- 
tion we will assume that Garfield et al. 
did set their experiment up in this 
fashion. This assumption will in no 
way invalidate our analysis. 
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TABLE 1 
EXAMPLE OF FACTORIAL DESIGN 


Split Session 


Male E Female E 


Male Ss | Fe Ss Male Ss Fe Ss 


A useful, although uncustomary, 
step in analyzing the independent 
variables of experiments designed in 
this fashion is to separate the physi- 
cal condition variables from the ‘‘per- 
son condition" variables. If, for 
example, the independent variables 
include 2 methods, 2 types of exam- 
iners, and 2 types of subjects, as 
does the Garfield experiment, it is 
important to note that we have a set 
of physical conditions (methods) and 
two sets of “person” conditions. In 
the experiment under discussion, the 
"person" conditions can be further 
broken down into subjects and ‘‘ob- 
jects," i.e., examiners. 

Person conditions (subjects). The 
customary principles of subject-popu- 
lation sampling apply here and need 
no discussion. 

Person condition (objects). Again it 
is easy to see that the same restric- 
tions concerning generalization apply 
here as in subject sampling. Failure 
to observe these restrictions results in 
Garfield's unjustified generalization 
that sex of the examiner does not in- 
fluence the subjects' response. Fac- 
torial design per se does not remove 
these restrictions. As Edwards says 
in discussing a similar hypothetical 
factorial design experiment wherein 
instructors and methods constitute 
the independent variables, “let us 
suppose that we have selected the 
instructors to represent particular 
types or personalities or abilities. 
The three used in the experiment are 
definitely not a random sample from 


less concern. Therefore sampling 0 
objects (examiners) will require 8 


sidered in terms of representative 


on factorial methods, for it c 
duced to the question of wh 


or interaction effects. 


Complete Session 


Male E Female E 


Male Ss | Fe Ss Male Ss | Fe Ss 


any defined population" (10, p. 249). 


In other words, generalization to 


other instructors is prohibited; con- 
clusions are confined to the particular 
instructors involved in the experi- 
ment. 


Here we would like to point out 
that in "effect of the examiner" ex- 
periments it ordinarily will be more 
important to achieve  preciseness 


with regard to the independent varia- 
ble than the dependent variable. 
That is, we will be primarily con- 
cerned with the problem of whether 
or not certain examiner variables | 


make a difference—to whom (that is, | 
to which subject-population) e 
make a difference will ordinarily beo 


larger sample, more carefully con- 


ness, than will sampling of pus 
Physical conditions. Here the issu 


of sampling and generalization be- 
comes somewhat obscure. 3 
easy to conceive of methods, or tests, 


It is not 


or other physical conditions being | 
selected by means of random "s 
pling procedures. Yet this issue ; 
random selection of conditions has 


very i portan i ear ing 
y impo tant pr i 


at one 


* he 

considers to be the error o y 
i ithin-groups variance , 
design (within-group: t wil 


interaction variance) agains 


" 1 ec 
to test the significance of main € 
: We turn now 


to this question. | 
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WirRIN-GROUPS VARIANCE AS ERROR 


Under the circumstance where the 
physical conditions of the experiment 
are selected by nonsampling methods, 
generalization must be confined to 
the population of subjects sampled 
with respect to the particular physical 
conditions present in the experiment. 
Within-groups variance is legitimate 
as error, since here is where random 
sampling took place. Therefore, gen- 
eralization takes place with regard to 
subjects only. 

But—what about generalizations 
concerning the conditions of the ex- 
periment? Here the crucial issue be- 
tween systematic factorial design and 
representative design lies in the man- 
ner of covariation of the physical con- 
dition variables among one another. 
Unless their arrangement (not merely 
their number) in the experiment is 
considered in light of the conditions 
toward which the generalization is 
aimed, our conclusions are restricted 
from that generalization. 

Fisher was very conscious of the 
necessity for achieving generalization 
concerning the range of conditions. 
For example: “The exact standardi- 
vation of experimental conditions, 
Which is often thoughtlessly advo- 
cated as a panacea, always carries 
with it the real disadvantage that a 

wehly standardized experiment sup- 
pee direct information only in re- 
ee of the narrow range of condi- 
s periem RE standardization. 
ie a wae: igen weakens 
ites, n gthens our ground 
EM g a like result, when, as is 
cadi y the case in practice, these 
ha ions are somewhat varied” 
ni. eU Fisher also considered 
vonditions c. £t the arrangement of 
a E ki e gr ne important 
ys ate designs and wrote 
pers (12, Ch. 17, 28) contrast- 
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ing the systematic arrangement of 
field conditions to random arrange- 
ment—to the disadvantage of system- 
atic arrangement. Brunswik draws 
the issue more sharply, pointing out 
that "generalizability of results con- 
cerning...the variables involved 
must remain limited unless at least 
the range, but better also the distri- 
bution . . . of each variable, has been 
made representative of a carefully 
defined universe of conditions" (3, 
p. 53).  Brunswik goes beyond 
Fisher in asserting that systematic 
factorial designs frequently “tie,” or 
link together, physical condition vari- 
ables according to the convenience 
(sometimes arithmetic) of the experi- 
menter's circumstance (3, p. 6). Like- 
wise, variables may be “untied”; 
that is, the links or correlations 
among variables in the situation to 
which the results are to be applied 
are often disrupted, usually through 
the “hold all other variables con- 
stant," or "isolate a variable" pro- 
cedure. Note the manner in which 
certain stimulus variables are arbi- 
trarily "tied" and "untied" in the 
following example. 

Personality and sex. Holtzman (17) 
was interested in ascertaining the 
effect of the examiner as a variable in 
the Draw-A-Person Test. His pro- 
cedure in establishing the independ- 
ent variables of sex and of personality 
was as follows: 

Four experienced examiners, two male and 
two female, were chosen from a group of ad- 
vanced graduate students in clinical psychol- 
ogy. The two pairs of examiners were selected 
so as to maximize differences in examiner ap- 
pearance and personality within both sexes. 
Examiner M1 was nearly a foot taller and 
sixty pounds heavier than the other male 
examiner, M2. The two female examiners, 
F1 and F2, were approximately the same size 
but differed considerably in feminine qual- 
ities" (17, p. 145). 


In line with our general emphasis 
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on the need for scrutinizing the man- 
ner in which the independent variable 
is constituted in these experiments, 
note that height and weight were 
"separated" or varied “within sex" 
for the male examiners (M1 was a 
foot taller and sixty pounds heavier 
than M2). On the other hand, the 
height of the female examiners was 
the same (their weights are not 
mentioned), but their "feminine qual- 
ities” "differed considerably." Ap- 
parently several different stimulus 
variables are "tied" (height with fe- 
male sex) and "untied" (height and 
weight from male sex and “feminine 
qualities" within female sex). This 
situation obscures the independent 
variable. This obscurity is bound to 
result when randomization is not 
effected in the sampling procedures, 
whether we are dealing with sampling 
of subjects or **objects" (in this case, 
examiners) .* 

Thus, when conditions are not 
samples, generalization is limited to 
the subject-population and therefore 
within-group variance is the only 
legitimate error term. 


INTERACTION TERMS AS ERROR 
Now consider the circumstance 


3 Note the hypothesis to be tested under 
these circumstances "(/) The sex of the ex- 
aminer has a measurable effect... (2) The 
personal characteristics of the examiner aside 
from sex have a measurable effect...” 
(17, p. 145). The "male variable" is estab- 
lished by drawing two males from the popula- 
tion, one each from some hypothetical per- 
sonality-type population. The same for the 
“female variable.” From this sample of ex- 
aminers Holtzman attempts to refute the 
findings of Sinnett and Eglash (22) (who also 
used /wo examiners) concerning the relation- 
ship between examiner personality and re- 
sponse to the Draw-A-Person Test. Obviously 
these particular examiners differed in many 
variables other than sex characteristics. Such 
variables can be eliminated as “causes” of 
results only by randomization—exactly as 
they are eliminated in sampling subject popu- 
lations. 
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where the interaction term might be 
considered as error. This is the point 
at which textbooks begin to meet the 
problem of random sampling of phys- 
ical conditions. In his discussion of 
the interaction term as error in his 
Design of Experiments (11, pp. 201- 
205), Fisher deals with the problem of 
random sampling of conditions al- 
most to the exclusion of other topics. 
Both Edwards (10) and Lindquist 
(18) consider this problem in connec- 
tion with psychological and educa- 
tional experiments in which methods, 
schools, etc. are conditions of the 
experiment. 

Edwards (10, p. 252) in discussing 
interactions as error terms states, 
“ ..it would be illogical to argue 
that the ... particular methods .+. 
selected for investigation have been 
randomly selected from a population 
of methods." Lindquist (18, p. 169) 
discusses this problem through a hy- 
pothetical experiment where style 
and size of type are the independent 
variables. '*... the particular sf 
(or sizes) involved may not strictly 
be considered as a random sample 
from a ‘population’ of styles (o 
sizes). The interaction variance Im a 
factorial design is therefore usually 
not strictly a measure of normally 
distributed random fluctuations 
which theoretically must be true 9 
the error term in any F-test or t-test. 

When discussing the necessity ff 
randomness in the use of interaction 
terms as error terms, Edwards makes 
very explicit the fact that it is “illogi- 
cal to argue” that particular me 
have been “randomly selected” an 
that particular instructors cannot 
considered random samples. Yet e 
chooses as an example an experimen 
by Child (7) which violates this de 
quirement (10, p. 261). Edwards 0€ 
Scribes the conditions as follows: 
“The variables introduced he 
follows: the sex of the children " 
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as subjects in the experiment; the sex 
of the experimenter present during 
the test situation; the nature of the 
barrier introduced between the sub- 
ject and the distant goal object; and 
the type of instructions given to the 
child." Child recognizes that his 
conclusion that "choice of a distant 
goal was more frequent in the pres- 
ence of a woman experimenter than 
in the presence of a man experimen- 
ter" is "severely limited by the fact 
that the sample of experimenters 
was limited to one of each sex" (7, 
p. 30). However, Child does use the 
pooled interaction term as error. 
"The 11 degrees of freedom pertain- 
ing to the 11 possible interactions 
were therefore pooled for an estimate 
of error" (7, p. 19). If the writer has 
correctly identified Child's interac- 
tion terms, four (EXbarrierXin- 
structions, EXbarrier, EXinstruc- 
tions, barrier Xinstructions) of the 11 
interactions do not include a variable 
selected at random. As Edwards put 
it, “furthermore, and this is most 
important, it is necessary that the 
categories of one or more of the varia- 
ables in the experimental design be a 
random selection from the population 
being sampled” (10, p. 252). This is 
followed by a footnote: “This condi- 
tion will not be met by argument 
after the experiment has been carried 
through to completion.” 

In summary, then, random sam- 
pling of the conditions, physical or 
otherwise, of the experiment is pre- 
Tequisite to the use of interaction 
terms as error variance. Ignoring 
this prerequisite violates the princi- 
ple of generalization. through ran- 
domization in exactly the same fash- 


lon as in the case of a nonsampling 
One-factor design. 


AGRICULTURAL DESIGNS IN 
PsyCHOLOGY 


The above remarks concerning 
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interaction terms lead directly to 
another issue—the use of agricultural 
designs in psychology. For example, 
in Lindquist's discussion of interac- 
tion terms as error, after very clearly 
pointing out the logic involved in 
compromising with the statistical as- 
sumptions involved, he concludes the 
discussion by emphasizing that 
“|. the procedure just recom- 
mended is arbitrary in character, 
although wide experience in agri- 
cultural research indicates that it is 
usually satisfactory" (italics ours) (18, 
p. 170). Snedecor, however, while 
discussing interaction terms as error 
terms in connection with an agricul- 
tural experiment, says, ‘The require- 
ment of randomness is apparently 
never met in this kind of work, so 
that statements about probability 
must be considered inexact” (23, p. 
303). 

The above remarks concerning 
agricultural research have been em- 
phasized because they are the key to 
a fundamental issue. The issue is 
whether the application of agricul- 
tural experimental procedures to psy- 
chological experiment needs further 
scrutiny. We believe that such 
scrutiny is in order on grounds that 
there are important discrepancies be- . 
tween the aims and conditions of 
most agricultural research and most 
psychological research.^ The follow- 
ing points bear on this question. 

The type of agricultural research 
for which Fisher developed factorial 
design was primarily engineering- 
type research. He was concerned 
4... that, in any case, there will be 
no reason for rejecting the experi- 


4 Lindquist also notes (18, p. 169) that 
there will be some circumstances where this 
procedure will not be reasonable. 

5 See McNemar (19) for a discussion of an 
important difference in the conditions of agri- 
cultural and psychological research in connec- 
tion with latin-square design. 
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mental results on the ground that the 
test was made in conditions differing 
in one or other of these respects from 
those in which it is proposed to apply 
the results" (11, p. 98). In engineer- 
ing-type experiments the selection 
and arrangement of the conditions 
are usually dictated by the particular 
circumstances to which the experi- 
menter wishes to apply his results— 
and to which he confines his results 
and conclusions, as Fisher made 
clear. A further characteristic of 
engineering-type experiments is that 
future conditions are under the con- 
trol of the experimenter, so that he 
can maintain the conditions of the 
experiment, e.g., issuing the same 
ratios of ingredients in the ration, the 
same ratios of chemicals in the fer- 
tilizer, etc., as used in the experi- 
ment. "Therefore, Fisher's factorial 
design methods are eminently appro- 
priate to the applied problems with 
which he was concerned. Where the 
problem of generalizing beyond a 
particular plot—the soil heterogene- 
ity problem—was concerned, how- 
ever, he was definite and explicit 
about the advantages of randomiza- 
tion. (See 11, 12.) 

Psychologists, however, have other 
problems at stake than those of appli- 
cation; for example, it is in the nature 
of the theoretical task to seek gen- 
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In an excellent paper, Moses (13) 
presents some of the principal non- 
parametric methods and an intuitive 
explanation of their rationale, prop- 
erties, and applicability, with a view 
to facilitating their use by workers in 
psychological research. One of the 
important topics in the field of non- 
parametric methods is the Kolmo- 
gorov-Smirnov statistic. Recent re- 
sults and tables on this topic have 
been prepared which contribute to- 
ward establishing the Kolmogorov- 
Smirnov statistic as a standard non- 
parametric tool of statistical analysis. 

We shall present an intuitive ex- 
planation of the rationale and uses of 
the Kolmogorov-Smirnov statistic. 
A table will be presented which facili- 
tates the use of the Kolmogorov- 
Smirnov statistic by research work- 
ers. Some illustrative examples will 
also be given. 


1. Two-SIDED TESTS 
1.1. One-Sample Test 


In this section tests of "goodness of 
fit" will be considered. That is, we 
shall be concerned with' the agree- 
ment between the distribution of a 
set of sample values and a theoretical 
distribution. Probably the most wide- 
ly used nonparametric test of ‘‘good- 
ness of fit" is the chi-square test (4). 
However, some evidence has been 
presented indicating that the test 
which we shall now describe, the 


1This paper was prepared in connection 
with research supported by the Office of Naval 
Research at the Statistical Research Center, 
University of Chicago. The author wishes to 
thank Mr. Herbert David, University of 
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Kolmogorov-Smirnov test for goi 
ness of fit, may be a better all-arou 
test, when it is applicable, thant 
chi-square test (1, 12). A concise 
scription of other tests of fit appr 
in (2). 
Suppose that a population 
thought to have som: compl 
specified cumulative frequency 
tribution function, say F(x). Thi 
for any specified value of x, 
of F(x) is the proportion of in 
als in the population having m 
ments less than or equal to X. - 
observed cumulative step-fui 
Sy(x) of a sample of N obse 
(that is, Sy(x) =k/N, where k 
number of observations less the 
equal to x) is expected to be f 
close to this completely specified d 
tribution function F(x). If it 8% 
close enough, we have eviden! 
the hypothetical distribution 
not the correct one. As a meas 
how far the observed cumuk 
step-function Sy(x) is from 
pothetical distribution F(«) 
the maximum absolute diffe 
between Sy(x) and F(x); 
dead lO — Sy(x) |. 9 
d is large-we have evidence tha! 
is not the correct population 
lative frequency distributio 
tion. 
Let us first consider the case 
F(x) is a continuous cumulati 
tribution function. If the 
population cumulative distri 
in fact F(x), then the samp 
tribution of d is known and 
tabled (1). For example, if FQ 
correct population distributi 
find from p. 428 in (1) that the pP 
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ability that dz: 5/15 is Pr(dz 5/15] 
-»1— Pr(d« 5/15] —1— .945 2.055, 
and Pr{d26/15} =1—.989 =.011 for 
a sample of N — 15 observations. 

Hence, in order to test the null hy- 
pothesis that F(x) is the correct popu- 
lation distribution using a sample of 
N=15 observations, the maximum 
absolute deviation d between the 
sample cumulative — step-function 
Su(x) and F(x) is computed. The 
null hypothesis is rejected when 
dz5/15, if the null hypothesis is 
tested at the .055 level of significance. 
If the test is at the .011 level of sig- 
nificance, the null hypothesis is re- 
jected when 2z: 6/15. 

Let us now consider the case where 
F(x) may be a discontinuous cumula- 
tive distribution function. Several 
articles dealing with the Kolmogorov- 
Smirnov tests claim that the methods 
apply only where the chance variable 
is continuous. This is true only if 
exact probability statements are re- 
quired. (The reader will note that 
exact probability statements cannot 
be made even if the chi-square sta- 
tistic is used to test goodness of fit 
since little is known about the actual 
sampling distribution of the chi- 
square statistic for finite sample size 
N and given F(x). The chi-square 
statistic becomes approximately dis- 
tribution-free when the sample size 
N approaches infinity, but is not dis- 
tribution-free for finite N.) The in- 
equalities stated in (5, 7) serve to 
validate the use of the Kolmogorov- 
Smirnoy statistic in the case where 
F(x) maybe discontinuous. From 
these inequalities we see that when 
F (x) may be discontinuous, the error 
obtained will be in the "safe direc- 
im if tables are used which assume 
ps F(x) is continuous. More pre- 
hat, the level of significance of a 
ti ased on the sampling distribu- 
uw of d will be no larger than the 
evel of significance of that test when 
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F(x) is assumed continuous. For ex- 
ample, if F(x), which may be discon- 
tinuous, is the correct population dis- 
tribution function, we find from the 
tables on p. 428 of (1) that the prob- 
ability that d25/15 is at most .055, 
and Pr(dz 6/15] $.011 for a sample 
of N=15 observations. Hence, in 
order to test the null hypothesis that 
F(x), which may be discontinuous, is 
the correct population distribution, 
the value of d is computed. If the 
null hypothesis is rejected when 
d 2 5/15, then the level of significance 
is, at most, .055. The test will be at 
no more than the .011 level of signifi- 
cance if the null hypothesis is re- 
jected when dz 6/15. Hence, if the 
same test is used as when F(x) was 
assumed continuous, the level of sig- 
nificance will be no more than .055 
(or .011). 

The Kolmogorov-Smirnov statistic 
can also be used to estimate prob- 
abilities and obtain confidence bands 
for the true cumulative distribution 
function F(x) (1, 7, 10, 15, 16). These 
confidence bands will be free from 
any restriction concerning the nature 
of the function F(x). We need not 
assume that F(x) is continuous in 
order to obtain confidence bands for 
F(x). Let us illustrate this use with a 
particular set of data. A sample of 15 
observations was obtained. If this 
sample is arranged in order of in- 
creasing size, we obtain 


1, 2, 2, 2,2, 4, 4, 4, 4, 5, 5, Si S: 


The reader may be interested to 
know how these 15 observations were 
obtained. A student was asked to list 
three men whom he liked and three 
men whom he did not like from 
among all the men he had known 
since birth. The number 0 was as- 
signed to the person liked the best, 
number 1 to the person liked second 
best, number 2 to the person liked 
third best, and so on, and 5 was as- 
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signed to the person disliked the 
most. He was then asked to place the 
number 0 on a sheet of paper if he 
thought that his number 0 person 
(best-liked person) would be the 
richest (among the total of six men 
listed). He was to place the number 
1 on the paper if he thought that his 
number 1 person (second best liked) 
would be richest, and so on, and the 
number 5 if he thought that the per- 
son he disliked most would be the 
richest among the six. Hence, a single 
number from 0 to 5 was obtained 
from this student. Fifteen such ob- 
servations were obtained by using 
fifteen different students as the sub- 
jects. The sample cumulative step- 
function Sis(x) for the 15 observa- 
tions is given in Fig. 1. From the 
tables on p. 428 in (1) we see that the 
chance is at least .945 that the maxi- 
mum deviation between S(x) and 
the true cumulative distribution F(x) 


1 2 3 4 5 


Fic. 1. Two-sipED 94.5 Per Cent CON- 
FIDENCE BAND FOR F(x) OBTAINED FROM A 
SAMPLE OF 15 OBSERVATIONS 
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will be less than 5/15. Hence, if a 
band of width 5/15 is drawn above 
and below Sis(x), we can state with 
at least “94.5 per cent confidence" 
that the true cumulative distribution 
F(x) lies within that band. The 94.5 
per cent confidence band for F(x) is 
illustrated in Fig. 1 by the dotted 
lines above and below Si;(x). There- 
fore, any number of statements of 
the following kind may be made 
simultaneously with at least 94.5 per 
cent confidence: F(0)<1/3, F(3) 
<2/3, and F(4) is a number between 
26.667 per cent and 93.333 per cent. 

Let us consider the null hypothesis 
that a student is equally likely to 
choose any one of the six numbers. 
Then the population cumulative dis- 
tribution function F(x) would be as 
given in Table 1. The values of 
Sis(x) and | F(«)—Sis(x)| are also 
given in the table. Since the maxi- 
mum absolute difference between 
F(x) and S(x) is 10/30 —5/15, the 
null hypothesis is rejected at the .055 
level of significance. 

The reader will note that the sig- 
nificance test which was performed 
was for a completely specified popu- 
lation "cumulative distribution func- 
tion F(x). In cases where parameters 
must be estimated from the sample 
(for example, when the null hypothe- 
sis is that the population distribution 
is normal with unspecified mean and 
standard deviation, and the mean 
and standard deviation must first be 
estimated from the sample), there are 
no theoretical results at present 
which give exact critical levels for the 
Kolmogorov-Smirnov statistic. The 
distribution of d is not known when 
certain parameters of the population 
have been estimated from the sample. 
It may be expected, however, that 
the effect of adjusting the population 
mean and standard deviation to those 
of the sample will be to reduce the 
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TABLE 1 
ABSOLUTE DIFFERENCE BETWEEN A SPECIFIED F(x) AND Su(x) FOR A SAMPLE 
oF 15 OBSERVATIONS 
x 0 1 2 4 5 
F(x) 1/6 2/6 3/6 4/6 5/6 1 
Su(x) 0 1/15 5/15 5/15 9/15 1 
| F(x) —Sis(x) | 5/30 8/30 5/30 10/30 7/30 0 


critical level of d. If the critical 
value of d (from tables which assume 
a completely specified population 
distribution) is exceeded in these 
circumstances, we may safely con- 
clude that the discrepancy is signifi- 
cant (see p. 73 of [12]). In cases 
where parameters must be estimated 
from the sample, the chi-square test 
is easily modified by reducing the 
number of degrees of freedom. The 
Kolmogorov-Smirnov test has no 
such known modifications. 


1.2. Two-Sample Test 


_ In this section the problem of test- 
ing whether two random samples 
have been drawn from the same 
population is considered. That is, 
we shall be concerned with the agree- 
ment between the distributions of 
two sets of sample values. 

Let us denote the observed cumu- 
lative step-function of the first sam- 
ple of N observations by Sy(x), and 
let. S'u(x) be the observed cumu- 
lative step-function of the second 
sample of M observations. The two 
uie step-functions Sy(x) and 

u(x) are expected to be fairly 
RM to each other if both sam- 
p ^ are drawn from the same popula- 
li If they are not close enough, 
Ehud have evidence that the 
eli Mn from different popula- 
(tet Pes is, the population cumu- 
inda uou function for the 
Him "x rom the first sample is differ- 

rom the population cumulative 


distribution for the values from the 
second sample.) As a measure of how 
far apart are the two cumulative 
step-functions we use the maximum 
absolute difference d^ between them; 
that is, d'2maximum,  |Sx(x) 
—S'u(x)|. When d' is large we have 
evidence that the samples came from 
different populations. 

Let us first consider the case where 
the values from both samples are as- 
sumed to have continuous population 
cumulative distributions F(x) and 
G(x), respectively. We wish to test 
the null hypothesis that F(x) =G(«) 
and the null hypothesis will be re- 
jected if the observed value of d’ is 
significantly large. The limiting dis- 
tribution of d’ has been tabled in (15), 
and a method of obtaining the exact 
distribution of d’ for small samples 
has been given (11) when in fact 
F(x) =G(x). A short table for equal 
size samples is also available (11). 
The explicit expression for the dis- 
tribution function of d' has been 
given recently in (6) for equal size 
samples. From the tables on p. 126 
of (11) we find that, say, in the case 
where M=N=15, the probability 
that d/z7/15, when in fact F(x) 
=G(x), is Pr(d'z 7/15) =1—Pr{d' 
<6/15} =1—.925 =.075, and Prid 
28/15} =1—.974=.026. Hence, in 
order to test the null hypothesis that 
F(x) =G(x) at the .075 level of sig- 
nificance, the value of d’ is computed 
and the null hypothesis is rejected 
when d'z7/15. If the test is at the 
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.026 level of significance, the null hy- 
pothesis is rejected when d' 2: 8/15. 

Let us now consider the case where 
F(x) and G(x) may be discontinuous 
cumulative distribution functions. 
The inequalities stated in (7) serve 
to validate the use of the Kolmo- 
gorov-Smirnov statistic in the case 
where F(x) and G(x) may be discon- 
tinuous. From these inequalities, we 
see that when F(x) and G(x) may be 
discontinuous, the error obtained will 
be in the "safe direction" if tables 
are used (11) which assume that F(x) 
and G(x) are continuous. For exam- 
ple, suppose two samples are drawn 
each containing 15 observations (M 
=N=15). In order to test the null 
hypothesis that F(x)=G(x), the 
value of d is computed. If the null 
hypothesis is rejected when d' = 7/15, 
the level of significance will be, at 
most, .075. The test will be at no 
more than the .026 level of signifi- 
cance if the null hypothesis is re- 
jected when d'z 8/15. The tests will 
be free from any restriction concern- 
ing the nature of the functions F(x) 
and G(x). We need not assume that 
the functions F(x) and G(x) are con- 
tinuous in order to obtain tests of the 
hypothesis that F(x) =G(x). 

Let us illustrate the problem of 
testing whether two samples have 
been drawn from the same popula- 
tion. We shall study the agreement 
between the sample of 15 observa- 
tions which was described in the pre- 
ceding section (numbers from 0 to 5 
obtained by using fifteen students as 


TABLE 2 


ABSOLUTE DIFFERENCE BETWEEN THE CUMULATIVE STEP-FUNCTIONS FOR TWO 
EACH CONTAINING 15 OBSERVATIONS 


0 1 
Su(x) 0 1/15 
S^ (x) 4/15 6/15 
| Su(x) —S^u(x)] 4/15 5/15 
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subjects of an inquiry) and a see 
sample of 15 observations. 
second sample is arranged in ord 
increasing size, we obtain 


0,0,0,0, 1, 1, 2, 2, 2, 2, 3, 3, 5,5) 
(The reader may be interested 1 


know that these 15 observations we 
obtained by using fifteen busin 
men as the subjects of the inquiry 
That is, each businessman wa di 
terrogated in the same manner 
students. The businessman was tl 
asked to choose the person who 
be the richest among the six 
listed in the order of his preferet 
The values of the cumulative 
functions S(x) and S'u(x) for 
first and second samples respe 
are given in Table 2. The val 
| Su(x) — S'u(x) | are also given in t 

table. Since the maximum absolute 
difference between Sıs(x) and 
is 7/15, the null hypothesis i 
jected at the .075 level of si 

cance. 


le i 


2. ONE-SIDED TESTS 
2.1. One-Sample Test É 


In Section 1.1 the statement 
made that the Kolmogorov-Smi 
test for goodness of fit may b 
better all-around test, when it is 
plicable, than the chi-square test. | 
an all-around test of goodness 
we mean a test of the null hypothe 
that the observed sample was dra! 
from a completely specified po 
tion without specifying the na 


3 4 
5/15 5/15 9/15 
10/15 | 12/15 12/15 
5/15 7/15 3/15 
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the alternate hypotheses. That is, 
the null hypothesis that the observed 
sample was drawn from a completely 
specified population is tested against 
the alternate hypothesis that it was 
not drawn from that population. In 
some particular problems more spe- 
cific alternate hypotheses may be de- 
sirable. For example, very often we 
want to decide not whether an experi- 
mental group is the same or different 
from the general population, but 
whether the experimental group is 
better than the general population, 
or more adjusted than the general 
population, etc. In such cases, the 
null hypothesis that there is no differ- 
ence would be tested against the 
alternate hypothesis that the experi- 
mental group is better, or more ad- 
justed, etc. 

Let us consider the numerical il- 
lustration presented in Section 1.1 
where the null hypothesis that a stu- 
dent is equally likely to choose any 
one of the six numbers 0, 1, 2, 3, 4, 5 
was tested against the alternate hy- 
pothesis that the student is not 
equally likely to choose any one of 
the six numbers. For this particular 
problem and for the student body 
which was under investigation it 
Seemed reasonable to expect that 
either (a) the six numbers would be 
equally likely or (b) there would be a 
tendency for the students to assign 
the higher numbers. Hence, in this 
Case it is desirable to test the null hy- 
Pothesis that the six numbers were 
equally likely against the alternate 
Ee os there was a tendency 
E ents to assign the higher 
eni us try to make more precise the 
E. dent that “there would be a 

i hare for the students to assign 

mi er numbers.” If the six numbers 
ay equally likely, then the propor- 

"ona the population assigning the 
er 0 would be F(0)=1/6. Also 
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the proportion assigning the number 
0 or 1 would be F(1) 2/6, and the 
proportion assigning numbers no 
larger than 2 would be F(2)-3/6. 
Similarly, F(3) 24/6, F(4) 45/6, and 
F(5)=1. If "there is a tendency to 
assign higher numbers," then the pro- 
portion G(0) of the population assign- 
ing the number 0 would be no more 
than 1/6 (the case where all numbers 
are equally likely). Also the propor- 
tion G(1) assigning the number 0 or 1 
would be no more than 2/6, and the 
proportion G(2) assigning the num- 
bers no larger than 2 would be no 
more than 3/6. Similarly G(3) $4/6, 
G(4) €5/6, and G(5) $1. Hence, the 
statement, "there would be a tend- 
ency for the student to assign higher 
numbers,” may be replaced by the 
statement that the true population 
cumulative distribution function G(x) 
is no more than F(x); that is, G(x) 
< F(x) for all values of x. 

The null hypothesis that the true 
population cumulative distribution 
is, in fact, the specified F(x) is to be 
tested against the alternate hypothe- 
sis that the true population cumula- 
tive G(x) is no more than F(x) (with 
the “less than" relation holding for 
some values of x). This problem may 
be considered the one-sided analog of 
the problem discussed in Section 1.1 
where the null hypothesis that the 
true population cumulative distribu- 
tion is in fact the specified F(x) is 
tested against the alternate hypothe- 
sis that the true population cumula- 
tive is not F(x). Asa measure of how 
far the observed cumulative step- 
function Sy(x) is from the hypotheti- 
cal distribution F(x) we use the maxi- 
mum difference c between Sw(x) and 
F(x). That is, c=maximum, [F(x) 
—Sy(x)], which is the one-sided 
analog of the maximum absolute dif- 
ference d. When c is large, we have 
evidence that F(x) is not the correct 
population cumulative distribution 
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and that the true population cumula- 
tive G(x) is no more than F(x). If 
the correct population cumulative is 
continuous and is in fact F(x), then 
the sampling distribution of c is 
known, and the explicit expression 
for the distribution function of c is 
given by equation 3.0 on p. 593 in 
(3). Using equation 3.0, we find for 
example that when N =15 the chance 
that c will be no more than 5/15 is 


1.5 /1 
Py(5/15) = 1 — 3x) 
j\e 


E jye C RI 
3 15 S»: $3 
Hence, the probability that cz 5/15 
is Pr(c» 5/15] 21—.97 =.03. In 
order to perform a “one-sided test” 
of the null hypothesis that F(x) is the 
correct population distribution using 
a sample of N —15 observations, the 
value of c is computed. The null hy- 
pothesis is rejected when cz 5/15 if 
the test is at the .03 level of signifi- 
cance. 

For the particular numerical illus- 
tration presented in Section 1.1, we 
find that c—5/15. Hence, the null 
hypothesis that a student is equally 
likely to choose any one of the six 
numbers is rejected at the .03 level 
of significance, and the alternate hy- 
pothesis that there is a tendency to 
assign higher numbers is accepted. 

In Section 1.1 a method was given 
for obtaining two-sided confidence 
bands for the true cumulative dis- 
tribution function F(x). This method 
may be modified in order to obtain 
one-sided confidence bands for F(x). 
For example, Fig. 1 gives a two-sided 
94.5 per cent confidence band for 
F(x) obtained from a sample of 15 
observations. We might make the 
one-sided confidence statement that 
the true cumulative distribution F(x) 
lies below the upper limit of the band 


= 97. 


/ 
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presented in Fig. 1. From the re 
presented earlier in this section” 
see that there is at least “97 per o 
confidence" that F(x) lies below tl 
upper limit of the band in Fig. 1. 


2.2. Two-Sample Test 


In this section a one-sided analogi 
the problem discussed in Section 
will be considered. We ish to tes 
the null hypothesis that ‘he cumula 
tive distribution function F(x) f 
the values from the first sample 
equal to the cumulative distributig 
function G(x) for the values from ti 
second sample. In other words, t 
null hypothesis is that F(x) =G@ 
The alternate hypothesis to be co 
sidered is of a more specific natu 
than the alternate hypothesis for 
all-around test presented in Sect 
1.2. We shall be concerned with tl 
alternate hypothesis that F(x) S G(z 
If the alternate hypothesis is, in 
true (with the “less than" relatie 
holding for some values of x), we $8 
that the population values from whi 
the first sample was drawn are St 
chastically larger than the populat 
values from which the second sam) 
was drawn. The importance of su 
alternate hypotheses has bel 
stressed in (8, 9). For example, V 
often we want to decide not wheth 
the experimental group is the sam 
different from the control group 9 
whether the experimental group” 
better than the control group, 
more adjusted than the control grou 
etc. In such cases the null hypothe 
that F(x)=G(x) would be tes 
against the alternate hypothesis t 
F(x)€G(x). For this one-sided à 
log of the problem described in > 
tion 1.2, we use the maximum G^ 
ence c' between the observed cumu 
tive step-functions Sy(x) and 5 Á 
of the first sample of N observat 
and of the second sample of M 4 
servations. That is, c 2 maxi 
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TABLE 3 


CutriCAL VaLvgs or c'N ror THE a: 100 
Per Cent LEVEL or SIGNIFICANCE 


[S'w(x)—Sw(x)] which is the one- 
sided analog of the maximum abso- 
lute difference d’. When c' is large, 
we have evidence that F(x) is not 
equal to G(x) and that the population 
values from which the first sample 
was drawn are stochastically larger 
than the population values from 
which the second sample was drawn. 
If the population cumulative func- 
tions are continuous and in fact F(x) 
G(x), then the limiting distribution 
of ¢ is known (9, 14). If F(x) &G(x), 
we find that the sampling distribu- 
tion of 4(c’)*MN/(M+N) will have 
approximately a chi-square distribu- 
tion with two degrees of freedom 
when M and N are large and M/N is 
not too close to either zero or in- 
finity. Hence, the tables of the chi- 
square distribution may be utilized 
to test the null hypothesis when M 
and N arelarge. When M and N are 
small, the exact distribution of c' may 
be computed by extending the count- 
ing method presented in (11) for the 
two-sided problem. The explicit ex- 
pression for the distribution function 
of c' has been given recently (6) for 
equal-size samples. Table 3 may be 
used to test the null hypothesis at 
either the 20 per cent, 10 per cent, 
per cent, 1 per cent, or 0.1 per cent 

| of significance if M=N. The 
table gives the critical value of c'N at 
€ various levels of significance. For 
a when N=15 we see from 
I 3 that 7 is the critical value of 
at the 5 per cent level of signifi- 
ance. Hence, the null hypothesis is 
“eee at the 5 per cent level of sig- 
‘cance if ^2 7/15. Using the ex- 
P'Icit expression (see [6]) for the dis- 
ution function of c’, we find that 


Pri? = 1/15} = 1 — Pr (e < 7/15} 


-(D/ Gene 


t us reconsider the numerical il- 
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lustration presented in Section 1.2. 
Since c/ =7/15, we reject the null hy- 
pothesis at the .038 level of signifi- 
cance that the population distribu- 
tion of the numbers assigned by the 
students was the same as the popula- 
tion distribution of the numbers as- 
signed by the businessmen and accept 
the alternate hypothesis that the 
numbers assigned by the students 
were stochastically larger than those 
assigned by the businessmen. In 
other words, we accept the alternate 
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hypothesis that there was a greater 
tendency for the students to assign 
higher numbers than the business- 
men. 

It is interesting to note that if ap- 
proximate critical values are com- 
puted using the chi-square (with two 
degrees of freedom) approximation 
the approximate critical values are 
always more than the exact critical 
values (in Table 3) minus 1. Hence, 
the error in using the chi-square ap- 
proximation is always in the “safe 
direction” for the levels of signifi- 
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cance in Table 3 even when the sam- 
ple size is small. In other words, if 
the null hypothesis is rejected using 
the chi-square approximation, it 
would also be rejected if exact com- 
putations had been made. It is also 
interesting to note that only in the 
following cases will the chi-square 
approximation lead to acceptance 
when an exact computation based on 


Table 3 would lead to rejection: © 


a=.001 and N=12, 15, 18, 21, 25, 
29; a=.01 and N —8, 14. 
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In their original article (7) Webb 
and Lemmon stated that the results 
of the over-all F test may not agree 
with results obtained by subsequent 
use of the ¢ test applied to individual 
means when a functional relationship 
exists between the independent and 
dependent variables. They gave hy- 
pothetical examples which, they 
thought, illustrated their point. Pat- 
terson (5) and Diamond (1) have 
both taken exception to the original 
article, but Webb and Lemmon in 
their reply (8) stated that “ . . . most 
of their [Patterson and Diamond's] 
criticisms have not been aimed 
directly at the core of our problem. 
Most of their discussion seems to deal 
with problems of the conventional 
analysis of variance situation, in 
which the means of the groups show 
No trend, but are randomly related 
to each other.... We were con- 
cerned, however, with a fairly com- 
mon experimental design in which 
the means of the groups show a 
definite ordering or trend, with re- 
Spect to some other experimental 
variable.” 

In all four of the articles written by 
the above authors, analysis of vari- 
ance has been discussed as though its 
n cw were to test for significance 
ilterences between means. This is 
ao what analysis of variance 
c pie often used for in psy- 
ip it research, but it has other 
the pes just as important as 
ids ifferences between means. 

- Ol the other uses of analysis of 
^id is that of testing to see 

er there is a significant linear 


or curvilinear regression between the 
independent and dependent variables. 
This is the test which should be used 
with Webb and Lemmon’s data since 
they hypothesize that a functional 
relationship exists. This test may be 
performed by an extension of the con- 
ventional analysis of variance pro- 
cedure. Fisher’s (2) method of orthog- 
onal polynomials is convenient to 
use for this analysis. Snedecor (6, 
Ch. 14, 15) discusses in detail the 
procedures necessary to make this 
analysis, and Johnson and Tsao (3) 
give a very complete discussion of the 
procedures involved in fitting orthog- 
onal polynomials to a 4X7X2X2 
X2 factorial design on determination 
of differential limens. 

The purpose of this note is to com- 
ment briefly on this technique and 
show how its use resolves the seeming 
paradox posed by Webb and Lem- 
mon. The principle underlying this 


procedure is that, when a test of ' 


functional relationship is desired, the 
between SS with k—1 df may be 
analyzed into k—1 orthogonal com- 
parisons, each associated with a par- 
ticular aspect of the regression line. 
Each of these components may be 
tested for significance independently. 
The first component isolated is that 
due to linearity, the second is the 
quadratic component, the third the 
cubic, etc. The df associated with 
each of these is 1, and the individual 
MS'sare tested against the within MS 
for significance. 

Cases I, II, and V from Webb and 
Lemmon's original article will be 
discussed with reference to this 
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method. Cases III and IV have to be 
eliminated since it is necessary, when 
using orthogonal polynomials, either 
to assume that observations are taken 
at equal intervals along the X-axis, 
or else the actual values on the X-axis 
must be known. For Case I there is 
no problem since only two sets of ob- 
servations are taken. In Case II 
Webb and Lemmon assume that 
their C group is midway between 
groups A and B, so equal intervals 
are present here. For Case V inspec- 
tion of the graph seems to indicate 
that the four groups are spaced at 
equal intervals, and that is the as- 
sumption made here. For Cases III 
and IV it is obvious that the groups 
are not spaced at equal intervals, so 
these cannot be discussed. 

To illustrate the procedure and its 
interpretations, the various groups 
were assigned arbitrary summation 
values (the results would have been 
the same no matter what values were 
assigned), and analysis of variance 
tables were constructed which yield 
F ratios identical with those given 
by Webb and Lemmon. For Case I, 
DA was assumed to be 11 and ZB was 
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assumed to be 33. With 11 Ss in each 
group it was simple to determine the 
SS and MS for the between-groups 
variance. Since F=4.35 by Webb and 
Lemmon's definition, it was possible 
to work back and determine what the 
within MS and SS had to be to yield 
this F ratio. The same procedure was 
followed for Cases II and V. For Case 
II, ZA211, 2B=33, and ZC222. 
For Case V, ZA-11, ZB-11, =C 
=33, and ED —33. Then the be- 
tween SS for Case II was analyzed 
into its linear and quadratic com- 
ponents, and the between SS for 
Case V was analyzed into its linear, 
quadratic, and cubic components. 
The MS's were then computed and 
were tested against the within MS. 
All these data are illustrated in Table 
1. The values which are not in paren- 
theses are the between and within 
sources of variance and are identical 
with Webb and Lemmon's data. All 
the data which are enclosed in paren- 
theses are the results of the orthog- 
onal polynomial analysis. y 

When Case II is analyzed in this 
manner, it is seen, though the over- 
all F test is insignificant, that when 


TABLE 1 


ANALYSIS OF VARIANCE OF Cases I, II, AND V FROM WEBB AND LEMMON'S 
4 HYPOTHETICAL Data (7) 
Data not in parentheses are from Webb and Lemmon. Data in parentheses are the results 


of the orthogonal polynomial analysis. 


Case Source SS MS ji 
EL 
I Between 22 22 4.35 
Within 101.20 5.06 
I Between 22 11 2.17 
(Linear) (22) (22) (4.38) 
(Quadratic) (0) (0) 
Within 151.80 5.06 
V Between m 14.67 2.90 
(Linear) (0) (0) 
(Quadratic) 44 (44) (8.70) 
(Cubic) (0) (0) 
Within 202.40 5.06 


EE KK c0 
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the linear and quadratic components 
are separated all the between vari- 
ance is explained by the linear regres- 
sion (which would have to be true 
since Webb and Lemmon assumed 
that the mean of their C group fell 
directly on the grand mean). Since 
there are 1 and 30 df to test the 
linear MS, this linear regression is 
more significant than that for Case I 
where the identical F ratio is found 
but is tested against 1 and 20 df. 
(Case I may also be considered to be 
a test of linearity, since the linear SS 
here is identical with the between 
SS.) This also follows logically since 
we would place more confidence in a 
linear curve determined by 3 points 
than we would in a curve determined 
by only 2 points. 

For Case V it may be seen that a 
highly significant quadratic relation- 
ship exists between the two variables. 
None of the regression is explained by 
the linear or cubic elements; the 
quadratic component accounts for all 
the between variance. Though in 
this case the over-all F is found to be 
barely significant at the .05 level, a 
more accurate interpretation of the 
experiment is obtained from this fur- 
ther analysis. 


This type of statistical analysis 
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would appear to be profitable in psy- 
chological research. As Webb and 
Lemmon indicate, the situation 
where a functional relationship exists 
between the independent and de- 
pendent variables is a fairly common 
one. Kogan (4) in his recent review 
of variance designs in psychological 
research points out that the pro- 
cedure of fitting orthogonal poly- 
nomials will frequently furnish infor- 
mation and answers to questions 
which cannot be obtained by an over- 
all F test to treatment means. How- 
ever, the fact that Kogan only cites 
one psychological study which has 
used this method—that by Johnson 
and Tsao (3)—would indicate that 
this technique has not been widely 
used by psychologists. 

In summary, the author would like 
to point out that a logical analysis of 
the experimental situation must pre- 
cede the selection of a statistical tech- 
nique. In the case where E suspects 
that a functional relationship exists, 
the over-all F test of treatment means 
is not the best test of this relation- 
ship. The use of the method of orthog- 
onal polynomials, however, will per- 
mit E to make an exact test of his 
hypothesis. 
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The problem of evaluating a series 
of statistical tests (e.g., t's, F's, x?'s) 
has recently received the attention of 
psychologists. The general approach 
to the problem is to set the signifi- 
cance level (p) at .05 or .01 and to 
find the chance probability of ob- 
taining at least s significant results. 
This is done by expanding the bi- 
nomial, (p--q)*, where gq=1—p and 
N is the number of significance tests 
made. This procedure is applicable 
when prediction of the outcome of a 
statistical test is independent of any 
other tests in the series (8). Wilkin- 
son (14) has published tables for at 
the .05 and .01 levels showing the 
probability of obtaining m or more 
significant statistics out of N calcu- 
lated statistics. Wilkinson’s tables, 
however, only run to N —25. BroZek 
and Tiede (1) in a subsequent article 
suggested that when N is equal to or 
larger than five it is possible to em- 
ploy the normal curve approximation 
to the binomial distribution. To do 
this, the critical ratio is calculated by 
means of the formula 


[n-M| -.5 


[4 


where M — Np, o-4/Npq, and —.5 
is the correction for discontinuity. 
The probability value is obtained 
from the table of area under one tail 
of the normal curve. To use this 
normal curve approximation, how- 
ever, N must be 100 or larger when 
p is taken as .05, or 500 or larger 
when p is taken as .01, since it will be 
remembered that Np must be equal 
to or larger than five. Between 
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CR 


Wilkinson's tables and the norm 
curve approximation there is à g 
which we feel should be filled in, 

The graphs which we provide 
run to N=100 for p=.05 (Fig. 1) ang 
N=500 for p=.01 (Fig. 2) a T 
be used when the normal cu 
proximation is not applicable. T 
the graphs, the .05 or the.01 I 
confidence is selected and the nui 
of calculated statistics (N) a 
number of significant statistics 
the chosen level of confide 
counted. The chance probabi 
obtaining at least n out of N $ 
tistics can be read off the graph i 
values between .001 and .50. Whi 
been plotted on a logarithmic 
and this fact should be taken 
account in interpolating for val 
N. For example, for n=7, N 
and p=.05 chance probability 
read from Fig. 1 as lying betwe 
and .01. One would conclude tha’ 
is not probable that obtaining: 
significant results out of 60 was € 
to chance alone. On the oth 
there is still the possibility: 
several of the seven signifi 
tistics might have occurred by 
alone. 

The current practice of t: 
critical values of t's, F's, x's à 
.05 and .01 levels of confidence 
the counting of significant sta 
at these levels and the use OF 
binomial distribution the most SU 
ble approach to the problem o! te 
ing the significance of a series € 
tistical tests, However, th 
two types of situations in W 
graphs will be inadequate. 
is one in which the level o 
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Fic. 1. CHANCE PROBABILITY OF OBTAINING AT LEAST # STATISTICS SIGNIFICANT AT THE .05 
LEVEL FROM N CALCULATED STATISTICS 


cance one desires to adopt is not .05 
- 01, but, for example, .10 or .001. 
po are at least three ways of 
andling this situation. The first is 

pon tables of binomial distribu- 
Sed The U. S. National Bureau of 
ie ards (16) has published tables 
: $ ranging from .01 to .50 and N’s 
oe 50, and Romig (12) has pub- 
ished tables for N’s from 50 to 100. 
d rdg method is to calculate the 
ue red binomial distribution directly, 
‘ined a calculating machine and a 
: poen working formula (9, pp. 
E 3). A third method is to use the 

isson distribution as an approxi- 


mation to the binomial distribution 
(4). When Np is taken to be less than 
five (the range within which the 
normal curve approximation is not 
applicable) and p is taken to be not 
larger than .10, the approximation of 
the binomial distribution by the 
Poisson is fairly good even for values 
of N as small as two. We have found 
that with these restrictions the larg- 
est absolute error in calculating cu- 
mulative probabilities is .02, and in the 
critical area of cumulative probabili- 
ties of .10 or less the error is not 
larger than .012. Soper (13), Molina 
(10), and Hartley and Pearson (7) 
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have published tables of Poisson dis- 
tributions, and Dixon and Massey (3) 
include an abridged Poisson distribu- 
tion in the appendix of their book. 

A second type of situation in which 
our graphs will not be adequate is one 
in which exact probabilities are cal- 
culated for a number of significance 
tests and a sensitive test of over-all 
significance of the series is desired. 
Fisher (5) offers a method of combin- 
ing exact probabilities from a series of 
tests of significance. His test is based 
on the formula for the probability of 
a chi square with two degrees of free- 
dom. Using this formula it is possible 
to transform p values to chi-square 
values. Using common logarithms, 


x? = 2-2.302585(— logo 5). 


Independent chi squares and their 
degrees of freedom (two for each chi 
square) can be summed and these 
sums referred to a chi-square table 
for a combined probability value. 
For an example which is worked out, 
interested readers are referred to 
Fisher (5, pp. 99-101) or to the arti- 
cle by Jones and Fiske (8). r 
To use this method exact probabili- 
ties of t's, x?’s, and F's are needed. 
Good approximations of exact proba- 
bilities can be obtained by linear 1- 
terpolation in available tables of t and 
x! by first expressing probabilities 1? 
natural or common logarithms. Since 
Fisher's chi-square technique calls for 
exact probabilities expressed in loga- 
rithms, the values found by interpo- 
lation need not be transíormed 
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their antilogarithm equivalents. The 
- same procedure applies to F, with the 
additional step of transforming F to 
square root of F before making the 
linear interpolation. The usual pub- 
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lished tables of F do not allow for 
interpolations for p values below .05. 
However, Hald (6) in his Statistical 


Tables and Formulas includes F dis- 
tributions for critical p values of .10, 
.30, and .50. Exact p values for F's 
can also be calculated from the in- 
complete beta function tabled by 
Pearson (11), and readers are referred 
to Burke's (2) article explaining this 
procedure, 
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COMMENTS ON SEEMAN'S OPERATIONAL ANALYSIS OF 
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Michigan State College 


While the present writers are in 
wholehearted agreement with Dr. 
Seeman's remark that formal opera- 
tional analysis is an indispensable pre- 
requisite to empirical investigation, 
it is also true that the results of the 
analysis, once it is completed, need to 
be criticized. After one has ab- 
stracted the formal structure of a 
theory, it is necessary to subject this 
formal structure to logical criticism. 

The first criticism is concerned 
with the restatement of the Freudian 
assertion that daydreams are wish- 
fulfillments. The actual restatement 
is as follows: ‘‘The emission of a day- 
dream is functionally related to a. spe- 
cific type of demand (wish), the rela- 
tion being such that whenever an in- 
stance of such and such a daydream is 
observed, it is required by the theory 
that an instance of a specified corre- 
sponding demand (wish) must be iden- 
tified by a suitable objective operation" 
(2, p. 377). Dr. Seeman then goes on 
to say: “It seems clear that, so 
stated, the theory really requires the 
occurrence of identifiable, lawful pat- 
terns of demand-daydream covaria- 
tion.” The point to be made here is 
that this restatement is not a state- 
ment in theory at all; it is, rather, 
metatheory. That is to say, the re- 
statement is a statement about 
theory, not a statement im theory (4). 
If the quoted statement were actually 
theory, it would be an assertion of a 
specific functional relation. There are 
many different functional relations 
which fit the description given in the 
quoted statement. 

Second, the theory does not require 
“the occurrence of identifiable, law- 
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ful patterns of demand-daydream & 
variation." The theory is the set 
statements which assert "identifi 
ble, lawful patterns of demam 
daydream covariation." Every e 
pirical theory consists of statemen 
which assert identifiable, lawful p 
terns of construct phenomena coval 
ation. This is required of the theor 
not by the theory on purely metho 
logical considerations. 
Third, on page 377, Dr. 
asserts: “What is crucially impot 
here is the understanding of the cø 
tingent notion of frequency, which li 
buried in this analysis of the , 
of the concept of wish-fulfillme 
Then on page 379, he asserts that @ 
the hypothesis that is predicte! 
the theory—is a predicate. NO! 
predicate, with its argument, 
two-valued constant (not a n 
valued functor); it is either true 
is false. It is not possible, wi hu 
predicate Q analyzed as it is, to 
dict anything about the frequency 
daydreams. K 
It is well to add in passing tha 
Seeman is not the only psycho 
who has confused statements 
theory for statements in th 
this respect. There is, as it 
great deal of precedent in moe 
psychological “theorizing” for call 
statements of this sort theory. 
The next point for considet 
concerns the ‘inductive leap 


statement is "an extremely e 
tary application of symbolic 
This is not the case. The sy 
statement is an extremely ele’ 
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operation excluded in symbolic logic 
—known as the fallacy of asserting the 
consequence (1, pp. 7-8). 

The reasons that we wish to ex- 
clude formal fallacies from our the- 
ories may be summed up as follows: 
A fallacy is a false form of statement. 
It is a well-known theorem in sym- 
bolic logic that a false statement im- 
plies any statement (3, p. 104). Now, 
it is the case that statements assert- 
ing both sides of every question about 
which any theory makes an assertion 
are included in the class of all state- 
ments (the class including every 
statement). Therefore, the theory 
which contains even one fallacy will 
predict both sides of every question 
about which it contains an assertion. 
There are three results of this state of 
affairs which are worth mentioning: 
(a) The theory is never wrong—it 
always predicts what is in fact the 
case, along with what is not the case. 
(b) The theory makes no unequivocal 
assertion, and the theorist must al: 
ways wait until after the fact to find 
out what his theory would predict. 
Thus, prediction is always of an ad 
hoc nature. (c) The theory is useless. 
Any procedure which purports to be 
the result of the use of the theory 
could proceed just as well without it. 

The next point for consideration 
concerns the discussion of the sen- 
tence [(P. P^,...,) 5 Q..]- [Q]. 

, On page 379 Dr. Seeman states 

-.. in those isolated instances . . . 
Where the confirmation conditions 
rote that ~Q is the case, the in- 
cation would be for a re-examina- 
tion of P' before P." With this point 
of view we disagree. Whenever a 
Statement derived from an empirical 
ed is disconfirmed the theory it- 
ad is aaa The only way out is to 
^ at the theory does not predict 
t e state of affairs that was discon- 
rmed. 


With the help of symbolic logic it is 


177 


easy to show that it is P that is false 
when one discovers a disconfirmation 
of the theory. There are two cases, 
namely: (a) P’ asserts a fact; (b) P' 
asserts an assumption (2, p. 379). 
Consider the first case: If it were 
true that P’ asserts a fact, then P’ 
cannot be false. After all, if P’ 
asserts an actual state of affairs, it 
can only be true. The theory asserts 
that (P-P') implies Q. We find ~Q; 
therefore, by Modus Tollens, we de- 
duce ~(P-P’). But P’ is true; 
therefore, P must be false (1, p. 25). 

Consider the second case: P' as- 
serts an assumption; i.e., P’ is an in- 
stance of P. The logical model for 
this situation is the reference formula 
known as Specification (Spec., see 1, 
p. 354). Applied to our situation 
Spec. asserts that P is a universal 
assertion that implies P’. It goes al- 
most without saying that the uni- 
verse of discourse here is the circum- 
scribed universe in which the theory 
under consideration applies. Since P 
implies P', and we have ~P', we 
may, by the reference formula known 
as Modus Tollens, deduce ~P. 

It is thus seen that the occurrence 
of a nonconfirmation (~Q) leads in 
every case to the denial of the theory. 
There is no way to save a theory if it 
actually predicts wrong; one can only 
change it. 

The present writers hope that the 
reader will not feel that they disap- 
prove of what Dr. Seeman has tried 
to do. On the contrary, it is felt that 
Dr. Seeman has accomplished two 
important things: (a) He has pro- 
vided the first (to our knowledge) 
attempt to demonstrate the logical 
form of the Freudian theory of day- 
dreams. (b) He has stated the results 
of his analysis in such a way that one 
can unequivocally determine its logi- 
cal characteristics. 

The present comments will be 
closed with a few words about the 
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implication of Dr. Seeman's analysis 
for the Freudian theory of day- 
dreams. If Dr. Seeman's analysis is 
correct, then all of the remarks which 
were made earlier, with reference to 
theories which contained fallacies, 
are applicable to the Freudian theory 
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of daydreams. These remarks were: 
(a) The “theory” predicts both sides 
of every question. (b) The "theory" 
is of an ad hoc nature. (c) The “the- 
ory” is useless; any procedure which 
purports to follow from the “theory” 
could proceed just as well without it. 


REFERENCES 


1. CooLey, J. C. A primer of formal logic. 
New York: Macmillan, 1949. 

2. SEEMAN, W. The Freudian theory of day- 
dreams: an operational analysis. Psy- 
chol. Bull., 1951, 5, 369-382. 


3. WHITEHEAD, A. N., & RussELL, B. Prin- 


PSYCHOLOGICAL B: 
Vol. 51, No. 2, 1954 


cipia mathematica. Vol. 1. Cambridge: 
Cambridge Univer. Press, 1925. 


4. Woopcer, J. H. The technique of theory 


construction. Int. Encycl. unified Scis 


1939, 2, No. 5. 
Received March 16, 1953. 


REPLY TO THE BEHANS 


WILLIAM SEEMAN 
Mayo Clinic 


In connection with the Behan 
paper, I shall confine myself to a few 
brief comments. 

1. The Behans' doctrinaire state- 
ments about theory suggest a con- 
fusion of logical fact with methodo- 
logical decision, especially with the 
notion of what Reichenbach calls 
“entailed decisions" (5, pp. 11-15). 
The impression conveyed in their 
paper that their remarks on theory 
are in accord with Woodger (6) is also 
erroneous. Actually, they are in dis- 
agreement with Woodger; and the 
"specimen theory" he presents (and 
which he therefore, presumably, con- 
siders acceptable as "theory'") would 
be ruled out by the criteria set out in 
the Behan paper. 

2. It is not true that I have com- 
mitted the fallacy of asserting the 
consequent. Given "If 'P' then 'Q' 
and assert 'Q' " it would be asserting 
the consequent if and only if I were 
to say "assert ‘P’ to be true on the 
basis of ‘Q.’ " But my paper does not 
say that; what it says specifically is 
“inductive leap to ‘P, " and this on 


the assumption that it would be un- 
derstood as a convenient shorthand 
for something like "accept ‘P’ pro- 
visionally as a proposition in the 
larger theory until such time (if any) 
that there is sufficient evidence to 
controvert it and render it useless in 
the theory." To call this ‘asserting 
the consequent” is to say either that 
all experimental procedure leads to 
this fallacy or to assert a stricture 
against all empirical science. And 
this, I presume, is what MacCorquo- 
dale and Meehl have in mind when 
they write “All scientific hypothesiz- 
ing is in the invalid ‘third figure’ of 
the implicative syllogism” (4, p. 97)- 
What purpose, after all, can an ex- 
periment serve if, from the results, 
one is forbidden to make inferences? 

3. With respect to the deductions 
which the Behans characterize as 
"impossible," the most effective refu- 
tation, it seems to me, lies in the sim- 
ple fact that the deductions have 
been done. To carry out an expert 
mental test of this I used a group 9 
eight professional logicians and st 
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experimental psychologists. They 
are in agreement in performing the 
"impossible" deductions. 

4. The Behans obviously have con- 
fused material implication with logi- 
cal implication! 

5. Their use of the formula Spec. is 
erroneous. This formula is not in the 
sentential calculus (e.g., PDQ), but 
in the logic of quantifiers, e.g., 
x(Fx 2Gx); on this, see Quine (3, pp- 
17-18, ch. 2). 

6. The best comments on the 
"proof" of the consequences of non- 
confirmation for any theory are to be 
found in Ayer and Cohen and Nagel. 

7. The Behan paper is guilty of in- 
valid inference. Specifically, in the 
final paragraph their conclusion 
would be formally valid if and only if 
there were a true additional premise 
"and everything we have said is fac- 
tually true and formally correct." 


! This confusion is evident in their equating 
material implication. with prediction, after 
stating the “well-known theorem." It is not 
true that a false statement logically implies 
any statement, and hence it does not predict 
at all. The "well-known theorem” is a 
theorem in the calculus of propositions and it 
states that a false statement materially implies 
any statement. To equate this with predic- 
tion leads to absurdities which I shall demon- 
strate ina moment. The distinction between 
these two kinds of implication is given by 
Cohen and Nagel somewhat as follows: 
Logical implication “involves no assumption 
às to the factual truth or falsehood of either 
of two propositions, but only that they are so 
UAM by virtue of their structure... 

at it is impossible for the implicating 
Pi dodge to be true and the implied one to 

false" (2, p. 127). It is quite different with 


material implication, which is “the name we 
give to the fact that one of a pair of proposi- 
tions happens to be false or else the other hap- 
pens to be true" (2, p. 128). Of material im- 
plication Quine says: “This relation is so 
broad as not to deserve the name implication 
s." (8, p. 29). 

The most effective way of demonstrating 
the consequences of this confusion is to exhibit 
its absurd results. Allowing the "well-known 
theorem” to formulate a prediction would lead 
to permitting a statement like the following: 
“Two plus two equals five predicts that Sacco 
and Vanzetti were executed for murder and 
Alfred Smith was defeated for the presidency 
in 1928 predicts that the base angles of an 
isosceles triangle are equal” (2, p. 127). Read- 
ers conversant with symbolic logic will have 
noticed that this is the same error which I 
committed in my 1951 paper. Had the 
Behans singled this out as a major error and 
confusion I should have had no choice but 
agreement. A second error was my failure to 
realize that the sentential calculus does not 
provide the resources for the formulation of a 
psychological theory (3, pp. 17-18, 65-71). 
This same error is also perpetuated in the 
Behan paper, which accepts the sentential 
calculus as the basis for argument. 

2 On this point Ayer and Cohen and Nagel 
are quite explicit. Ayer writes that, in the 
case of a nonconfirming event, "we may con- 
clude that the [theory] is invalidated by our 
experiment. But we are mol obliged to adopt 
this conclusion [italics mine]. If we wish to 
preserve our [theory] we may do so by aban- 
doning one or more of the other relevant hy- 
potheses" (1, p. 94). Cohen and Nagel take 
the same position: “The logic of the crucial 
experiment, therefore, is as follows: If Hi 
(theory) and K (assumptions) then i. But 
p is false; therefore either Hy is false or K (in 
part or completely) is false" (2, p. 220). 
Readers who may wonder how it was so 
"easy" to demonstrate the reverse of this 
“with the help of symbolic logic” will find the 
answer in the previous paragraph; i.e., the 
Behans applied a formula from the calculus of 
quantifiers to a case in the sentential calculus. 
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AN EVALUATION OF THE ANNUAL REVIEW OF 
PSYCHOLOGY (VOLUMES I-IV) 
LYLE H. LANIER 
University of Illinoi 


One consequence of the postwar ex- 
pansion of professional psychology— 
as science, as education, and as prac- 
tice—has been an increasing volume 
of literature in all its branches. Beset 
by growing scientific specialization 
and professional diversification, psy- 
chologists individually have had de- 
creasing time and competence to give 
to the assimilation of this profusion 
of publication. The important func- 
tions of review, evaluation, and syste- 
matization have lagged seriously be- 
hind the accumulation of articles, 
monographs, and books. Whether or 
not this material really represents a 
single universe of scientific discourse 
or can be transformed into a unified 
conceptual structure have been ques- 
tions of mounting concern to psy- 
chologists. 

Into this state of growing confusion 
the Annual Review of Psychology was 
introduced in 1950. It “was con- 
ceived as a supplement to, rather 
than a duplication of, other publica- 
tions presenting abstracts or rela- 
tively long-term reviews of psycho- 
logical literature . . . to present crit- 
ical appraisals of current research 
and theory in psychology on an an- 
nual basis in the case of the most 
active and general fields and on a bi- 


1 SroNE, CALVIN P., & Taytor, DoNALD W. 
(Eds. Annual review of psychology. (4 vols.) 
Stanford, Calif.: Annual Reviews, Inc., 1950, 
1951, 1952, 1953. Two of these volumes have 
been previously reviewed in this JouRNAL: 
Vol. 1 by H. H. Kendler in 1951 (pp. 159-161) 
and Vol. 3 by M. H. Marx in 1952 (pp. 657- 
660), 
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ennial basis for fields of lesser activity 
or scope." Four volumes have now 
appeared, edited by Calvin P. Stone 
with Donald W. Taylor as associate 
editor. The purpose of the present 
review is to evaluate this series as a 
whole, and to comment in particular 
upon Vol. 4. 


GENERAL APPRAISAL OF THE SERIES 


Although the Annual Review is 
widely known among psychologists, 
there are probably many who do not 
know in detail what the series has 
contained and who the contributors 
have been. Both for this reason and 
for its uses as a framework for the 
present review, a summary of infor- 
mation concerning all four volumes 
is presented in Table 1. 

Size and bibliographic scope. The 
Annual Review has grown from 330 
pages in 1950 (including indices) to 
485 pages in 1953—an increase of 
about 47 per cent. This increase in 
length has roughly paralleled the 
number of references cited. In 1950 
there were 1,594 titles in the eighteen 
chapters, while in 1953 the number 
was 2,003 for nineteen chapters. The 
gross increase was 38 per cent, while 
the average increase per chapter was 
about 25 per cent (from 89 to 111). 

There are interesting differences 
among the authors in the number of 
references cited, both for different 
reviews of the same topic and for 
reviews of different topics. For exam- 
ple, Sears in 1950 listed only 39 
articles on personality, while Bron- 


contrast, 


‘nual Review. 
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TABLE 1 
Summary or Contents OF Annual Review of Psychology 
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Volume 1 Volume 2 Volume 3 Volume 4 
Author Pp. Refs. Author Pp. Refs. Author Pp, Refs. Author Pp, Refs. 
Jones & 8 38 Barker 22 69 Nowlis& 28 136 Harris 30 100 
Bayley Nowlis 
Melton 22 101  Buxton 22 290 Harlow 26 121 Underwood 28 103 
Bartlett 18 169  Chapanis 20 96 Helen 30 120 Vernon 30 154 
ewman 22 105 ever 14 94 Garner 20 587 Licklider 22 92 
Geldard 16 62 Pfaffmann 16 88 Wendt 26 172 Ruch 26 ól 
Thorndike 18 89 Tyler 18 91 Humphreys 20 79 Anastasi 20 139 
rs 4 3 acKin- 24 73 Eysenck 24 104 Bronfen- 26 111 
non brenner 
Bruner 32 101 Katz 36 101 Smith 30 113  Newcomb 32 117 
Shartle 22 128 Bellows 20 98 Brown ms 28 126  Harrell 24 140 
Hebb 16 67 Deese & 24 136 Nissen& 28 157 iee 16 71 
Morgan Semmes 18 107 
Cameron 18 142 Taub 22 216  Zubin 22 68 White 22 15 
~ x ARA Ly 109 E 2 i Raimy E] 159 Sanford 2 " 
nyder 
Sronnasn rd E Es n) Elmgren 28 104 Carter 20 129 
ie 
Bordin 10 43 Pepinsky 18 so Gibert 30 120 Williamson 18 99 
Grant 20 113 wards 18 67  McNemar 10 33 Mosteller 28 84 
Johnson 14 39 
Shock 18 146 
Mowrr 20 SS 
Cobb 26 138 
Bergmann 23 40 


fenbrenner cited 111 in 1953. By 
Cameron mentioned 142 
references in his 1950 discussion of 
abnormalities of behavior, while 
White listed 75 in 1953. Such varia- 
tions are, of course, very difficult to 
interpret since authors differ greatly 


_ in the extent to which evaluation or 


even individual mention is made of 
numbered references. 

The number of references cited in 
Vol. 4 is roughly 25 per cent of the 
number of entries in the Psychologi- 
CallAbstracts for 1952. Thezsignifi- 
Cance of this percentage is by no 
means clear, however, since the Psy- 
Chological Abstracts cites many in- 

rmational and “professional” con- 
tributions which would not properly 
Come within the purview of the An- 
3 It also includes many 
articles in related fields which would 
not normally be included in an an- 
nual review of psychology. Moreover, 
the undetermined amount of dupli- 


Cation among the references in the 


Annual Review would inflate this 
E index of “bibliographic cover- 


It is of some interest to compare 
quantity of literature reviewed in the 
Psychological Bulletin with that 
covered in the Annual Review. In 
1950 some 1,363 titles were cited in 
articles and book reviews in the Psy- 
chological Bulletin, while the Annual 
Review listed 1,594 references. In 
1953 the comparable figure for the 
Bulletin was 1,322 while that for the 
Annual Review had increased to 
2,003. These numbers are not, of 
course, commensurable in any strict 
sense, partly because of the differing 
objectives of the two publications and - 
the undetermined amount of duplica- 
tion among the references in each of 
them. 

Topical organization. lt is to the 
chapters concerned with “basic” psy- 
chological science that one turns for 
the Annual Review's "systematic 
psychology"—its conceptual organi- 
zation of the scientific subject matter 
of the discipline. Such chapters com- 
prise approximately two-thirds of 
each volume. The topics fall into 
quite heterogeneous categories. Cer- 
tain chapters are focused upon classes 
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of "primary" dependent variables 
as subject matter—general modes of 
behavior such as seeing, hearing, 
learning, and problem solving. An- 
other set of chapters is organized in 
terms of “secondary” dependent vari- 
ables such as indices of abilities, per- 
sonality characteristics, and patterns 
of social behavior. A third set of 
chapters is concerned with different 
types of behaving subjects—animals, 
children, the aged, the disabled, the 
abnormal. The material on physio- 
logical psychology is partly a mis- 
cellany, but its distinctive "syste- 
matic" concern would seem to be 
with the influence of a set of inde- 
pendent variables upon a wide variety 
of behavioral functions. In this re- 
spect it is similar to those aspects of 
social psychology which consider the 
effects of social and cultural condi- 
tions upon the behavior variables 
serving as systematic categories for 
other levels of psychological analy- 
sis. 

By and large, this multidimension- 
ality of “chapter headings” is typical 
of the general state of contemporary 
systematic psychology—a state of 
essentially — "logic-tight" compart- 
ments insofar as formal conceptual 
relations go. The editors of the 
Annual Review take the field pretty 
much as they find it, and apparently 
make little direct attempt to change 
it towards greater logical or psycho- 
logical order. This judgment is not 
necessarily a criticism of a publica- 
tion such as the Annual Review. It 
would be difficult for the editors to 
do otherwise than present psychology 
in terms of the "chapter headings," 
the clusters of scientists, and the 
classes of institutions which underlie 
the confused mosaic of contemporary 
psychology. Scientific writers prob- 
ably work more easily (and willing- 
ly!) on tasks defined in terms of famil- 
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iar concepts and preferred categories 
A related consideration is the 
that the literature tends to be g 
ganized in these terms (for examp 
in the Psychological Abstracts) 
hence is more easily surveyed th 
would be the case if an idiosyne 

framework for systematic e 
tion were established by editorial fiat 
Finally, practical considerations 
make it virtually impossible for the 
editors to achieve any kind of ex 
post facto editorial coordination or 
integration among the several co 
tributions. There is insufficient 
between the submission of the m 
scripts by the authors and the publi- 
cation of the volume to accomplish 


emendation of the material. 
various chapters cover the literati 
up to within six to seven months 
the publication date of a given 1 
ume. To complete the writing, 
ing, and printing within such a pe 
is a remarkable achievement in 
Nevertheless, it is unfortunate: 
that certain focal points of conte 
porary psychological discussion a 
largely lost within the present topi 
structure of the Annual Review. P 
ception and motivation are € 
ples. Volume 3 has a chapter on mo 
vation, but none of the four volum 
has had one on perception. Most 
the important literature in these areas 
has probably been reviewed under 
other headings, but there have not 
been the continuing, unified co 

mentaries upon the work in th 
fields which their importance in conc. 
temporary psychological thinkin 
tifies. 


not be transformed into any 
resembling a unified body of kno! 
edge. But it is precisely because 
the confusion as to the meaning ê 
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systematic status of such terms that 
continuing efforts at critical method- 
ological appraisal of their usage in 
research and theory would be valu- 
able. 

More extensive, independent treat- 
ment of such topics as perception, 
motivation, and symbolic behavior 
would add, of course, to the length of 
a volume of the Annual Review. If 
compensating reductions had to be 
made they might well affect the “psy- 
chophysiological'" literature. Each 
of the four volumes has had three 
chapters devoted to sensory processes, 
whereas for perception, motivation, 
and thinking only the last two have 
appeared once each in single volumes 
(see Table 1). Together with special 
treatment of “physiological psychol- 
ogy,” this pattern of differential em- 
phasis overstresses psychophysiologi- 
cal literature at the expense of strictly 
“psychological” material. The physi- 
ological bias is most pronounced in 
the chapters on somesthesis and the 
chemical senses. A check of the 
publication media represented in the 
bibliographies of two of these four 
chapters (Vols. 2 and 4) shows that 
approximately 80 per cent of the 
references are to be found in physi- 
ological and medical publications. 
Less frequent review of this literature 
would make space available for more 
adequate consideration of topics of 
greater significance to psychology as 
à whole. 

The reviewers of Vols. 1 (Kendler) 
and 3 (Marx) in this JOURNAL 
thought that the several chapters de- 
voted to clinical psychology and 
counseling should have been con- 
densed and integrated. A single 
an for psychodiagnosis and an- 

er integrated chapter for psycho- 
f erapy has been the usual proposal 
or condensing these reviews. There 
certainly is overlap among the prob- 


lems and techniques used, respec- 
tively, in "clinical psychology" and 
in "counseling." Indeed, this is a 
prominent subject of discussion by 
several of the reviewers, mainly those 
in the counseling area who are trying 
to find a distinctive definition of their 
field. But there are significant dif- 
ferences between these two fields in 
terms of literature cited in the Annual 
Review. For example, Gilbert's single 
chapter on counseling methods in 
Vol. 3 lists only 23 of the 196 refer- 
ences in Magaret's chapter on "'clini- 
cal" psychodiagnostics and only 11 
of the 79 references in Raimy's chap- 
ter on psychotherapy. The compa- 
rable figures for Vol. 4 show even less 
overlap: Williamson's "counseling" 
chapter includes only 17 of the 115 
references cited in Rotter's review of 
“clinical” psychodiagnostics, and 
only two of Sanford’s 68 references 
in the chapter on "'clinical" therapy. 
To a considerable extent, of course, 
the degree of relative bibliographic 
independence may have been due to 
deliberate effort to avoid unneces- 
sary duplication. Nevertheless, these 
figures, together with substantive dif- 
ferences between the parallel treat- 
ments, suggest that “clinical” and 
"counseling" psychologists at pres- 
ent have less over-all professional 
communality than the formal re- 
semblances of their methodologies 
and underlying principles would seem 
to indicate. It might be added that 
in Vol. 5 the editors plan to have a 
single chapter on theory and tech- 
niques of assessment, one on psycho- 
therapy and one on counseling meth- 
ods. This arrangement is a step in 
the direction of "integration" and 
yet it will allow for special considera- 
tion of whatever distinctive problems 
and techniques there might be in 
the nonclinical aspects of counseling. 

Evaluative character. The pattern of 
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topical organization provides only the 
structural framework for the ap- 
praisal of the literature. Other im- 
portant determiners of the success of 
an annual review are the selection of 
contributors, the formulation of the 
reviewing task, and constructive ac- 
ceptance by the reviewers of the task 
defined. Concerning the contribu- 
tors, the list of authors in Table 1 is 
evidence that the editors have in gen- 
eral succeeded in securing psycholo- 
gists from whom outstanding evalua- 
tions of the literature would be ex- 
pected. The pessimists who pre- 
dicted that the Annual Review would 
fail to recruit a continuing force of 
first-rate contributors were wrong— 
so far as these four volumes go. 

With respect to definition of the 
task, the preface to Vol. 1 states: 
“They [the authors] were at liberty 
to indicate their points of departure 
by citing summaries of basic work 
prior to the review period but were 
advised not to strive for mere com- 
prehensiveness, especially in the form 
of a loosely integrated series of ab- 
stracts of all the current literature. 
On the contrary, the editorial board 
asked that they adopt an interpreta- 
tive and evaluative approach to the 
literature they selected for review.” 
Thus the editors clearly foresaw the 
danger that the reviews might de- 
generate into unorganized biblio- 
graphic digests. They appeared to 
recognize also that certain of the re- 
views in Vol. 1 had failed to avoid 
that danger, since they remark fur- 
ther: “While it appears that all fields 
are not equally amenable to this ap- 
proach, it is hoped that future vol- 
umes of the Review will reflect gains 
in the techniques of exposition and 
interpretation of the desired kind." 

To what extent have the authors 
been able to achieve the general ob- 
jectives set for them by the editorial 
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board? Have successive volum 
shown improvement in the “ 
niques of exposition and interpi 
tion?" Are there persistent differ 
ences among fields in the "evaluatiw 
character" of the reviews? In th 
attempt to answer these questions, T 
devised a procedure for appraisin 
individual chapters in term: of 
sets of presumably desiraule 
acteristics: (a) an introductory oi 
tation which defines the field 
establishes a general methodolog 
setting for the review; (b) exp 
indication of the author's “s 
matic" approach to the field, inclut 
ing the rationale for his topical of 
ganization and bases of selection 
among references; (c) interpretatiy 
and evaluative comments upon th 
literature, including a summary 
assessment of over-all achievem 
and trends (e.g., in methodolog 
in empirical knowledge, in theory 
technology—as appropriate). In 
effort to simplify the difficult j 
mental task, the appraisals we 
stricted to "factual" estimates d 
whether or not (for the first two 
teria) and to what extent (for 
third) the chapters exhibited 
characteristics. As far as p 
differences in quality of writin 
technical sophistication, and in 
ity of argument were disregardet 
The results of these apprai 
represented by the figures in Tab 
In general, it appears that abou 
third of the 63 chapters commo 
the four volumes lack the ki 
“orientational” ^ introduction 
scribed above. More than 
them omit any explicit disc 
the reviewer's “systematic” ana 
or organization of the field. E 
fourth have little or no interp 
tion or evaluation of the literatur 
viewed. These generalization 
subject, of course, to whatever 4 
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TABLE 2 


RescvLTS OF RATINGS OF CHAPTERS FOR 
“ORIENTATION,” “SYSTEMATIZATION,” 
AND “INTERPRETATION-EVALUATION"® 


Volume 
Classification 


No. of chapters with- 
out "orientation" 8 6 5 3 
No, of chapters with- 
out "systematic" 
rationale 10 12 9 9 
Extent of interpreta- 
tion and evaluationt 
Median rating X. 4 
No. of chapters 
rated 1 or 2 5 6 2 3 
—ÉÁX 00 


* The chapters common to all four volumes were in- 
cluded, as well as those condensed or expanded in the 
last two volumes (see Table 1). 

„$ The chapters were rated ona 7-point scale in which 

1" represented the minimal and “7” the maximal de- 
ree of “interpretation-evaluation.” 


5.5 4.8 


fications might be required by the 
unreliability of the method and by 
Possible idiosyncrasies in this single 
reader's standards of judgment. 
. The editors' hope for improvement 
in future volumes—expressed in Vol. 
l—appears to have been fulfilled, if 
the figures in Table 2 can be accepted 
as evidence, Clearly, Vols. 3 and 4 
excel the first two in estimated degree 
of interpretation and evaluation, with 
Vol. 3 Standing highest and Vol. 2 
lowest in this respect. Two mitigat- 
ing circumstances in behalf of Vols. 
land 2 should perhaps be noted: (a) 
the writers of the later volumes could 
eet by seeing the earlier ones and 
rom critical reactions to them on the 
c of the editors and other critics; 
n the last two volumes are consider- 
ably longer, thus presumably making 
ae Space available for evaluative 
à mmentary, The weight of these 
Onsiderations is reduced, however, 
ze fact that individual reviews in 
ae s. 1 and 2 are rated just as high 
those in Vols, 3 and 4, The incli- 
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nation and critical capability of the 
individual author are probably the 
principal determiners of the vari- 
ance among reviews in evaluative 
character. 

There might, nevertheless, be dif- 
ferences among fields in "amenabil- 
ity” to the kind of reviewing in ques- 
tion, as the editors have suggested, 
Although not conclusive evidence for 
such a question, my individual rat- 
ings show that the reviews of the 
applied fields are less "systematic" 
and “evaluative” than those in the 
strictly scientific areas. Again, how- 
ever, the intragroup differences are 
much greater than the average dif- 
ferences between the groups. Cer- 
tain of the reviews in clinical psy- 
chology stand as high by our criteria 
as those in any other field, in contrast 
to the reviews of industrial psychol- 
ogy. With such a broad range of 
application of techniques from so 
many branches of psychology, the 
literature in the latter field is quite 
heterogeneous, and many of the 
studies are highly “atomistic” in 
nature. Integrative evaluation and 
systematization are obviously easier 
to accomplish in an applied field such 
as clinical psychology where the ma- 
terial is organized more definitely in 
relation to general principles and 
theory. 


VoLuME 4—1953 


With its nineteen chapters and 485 
pages (including indices), this largest 
member of the Annual Review family 
presents one innovation which will be 
welcomed by most psychologists: the 
inclusion in several of the bibliogra- 
phies of the full titles of all references. 
This option was apparently avail- 
able to all authors but was exercised 
by only seven of them. (It is dis- — 
appointing to find that the chapter 
on special disabilities follows the 
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Annual Review's original practice of 
numbering references in the order of 
citation in the text.) 

There are two ''special" reviews: 
a chapter by Bergmann on theoreti- 
cal psychology and one by Cobb on 
special disabilities. Otherwise, the 
organization of topics remains es- 
sentially what it was in earlier vol- 
umes (see Table 1). There are sepa- 
rate chapters on comparative and on 
physiological psychology in place of 
the former combined chapter, and 
counseling continues to have a single 
chapter as in Vol. 3. In general, 
Vol. 4 is a very impressive addition 
to the evaluative literature of psy- 
chology. 

Child psychology. In contrast to 
Barker's soul-searching analysis of 
the uncertain and declining state of 
this field in Vol. 2, Harris presents a 
picture of buoyant empiricism, docu- 
mented by the longest bibliography 
in the book. With so much to write 
about, he doesn't worry over the 
difficulty of defining the field rigor- 
ously—in contrast to previous re- 
viewers. He is mainly concerned to 
report upon the 160 studies in the 
bibliography, in the form of brief 
summaries of method and results. 
There is little evaluation or system- 
atic interpretation, and the organ- 
ization of the material is poor. 

Learning. Underwood's review 
continues the generally high level of 
previous reviews in this field—par- 
ticularly with respect to careful in- 
terpretation and evaluation of signifi- 
cant studies. He excludes from con- 
sideration the host of investigations 
in which learning is merely involved 
or demonstrated, restricting the re- 
view to "work in areas with estab- 
lished methodology and some theoret- 
ical orientation." This means a fair- 
ly heavy concentration upon animal 
learning and human conditioning, an 
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unfortunate necessity arising from the 
state of the field and which Under- 
wood probably deplores as much as 
anyone. Research on thinking, he 
notes, "continues to move at an ap- 
pallingly slow pace," although *'there 
are rumblings which portend better 
things to come.” 

Little attention is given to ques- 
tions of over-all systematization of 
the field of learning, either by way of 
introductory orientation or in the 
main body of the review. Perhaps 
conceptual cross references and inte- 
gration are not possible for most of 
these presently somewhat discrete 
problems, but continuing attention 
should perhaps be given to these 
desiderata on the part of reviewers in 
the Annual Review and in similar 
publications. 

Sensory processes. Vernon's review 
of visual literature is the only one of 
the three chapters on the “senses” 
which presents an organizing frame- 
work for the literature to be reviewed. 
Licklider for hearing and Ruch for 
somesthetic and chemical senses get 
under way merely with brief indica- 
tions of what their respective re- 
views will emphasize. Once past the 
introduction, however, Vernon con- 
tents herself pretty much with de- 
scriptive reporting of a wide range of 
literature (extending from simple 
threshold phenomena to the "New 
Look" studies of motivational fac- 
tors in perception). Licklider, on the 
other hand, writes a highly lucid 
interpretative evaluation of the audi- 
tory literature. It appears to am 
outsider to be a balanced account 0 
the entire spectrum of research in 
this field, insofar as a single years 
output yields such a product. Ruch’s 
chapter is restricted almost entirely 
to research on physiological mecha- 
nisms underlying somesthesis, WY 
only about three pages on the “chem! 
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cal senses." It is a well-written re- 
view, somewhat better in respect of 
interpretation than of evaluation. 

Comparative and physiological psy- 
chology. Hess remarks about com- 
parative psychology that it is “prob- 
ably high time we stop deploring the 
lack of an orienting theoretical ap- 
proach.” He finds one in the work of 
European investigators, particularly 
Tinbergen and Lorenz. With this 
general orientation he proceeds to 
give a good account of a relatively 
small number of studies. Other com- 
parative psychologists, notably 
Schneirla, will probably disagree with 
Hess's view that this recent Euro- 
pean work is the needed "'shot in the 
arm" for their somewhat neglected 
field. 

Neff doesn't discuss the systematic 
status of physiological psychology, 
although he commences his review 
with „two brief paragraphs which 
mention the problems of major cur- 
rent interest to the specialist in this 
field. He then proceeds to a descrip- 
tive digest of studies classified under 
these headings: sensory discrimina- 
tion, basic drives, emotion, learning- 
memory-problem solving, biochemi- 
cal and neuroanatomical changes in 
mental disease. There is almost no 
interpretation or evaluation. 

A Diferential psychology and person- 

y. Four chapters fall under these 
general headings: individual differ- 
i (Anastasi), personality (Bron- 
Bene, abnormalities of be- 
(Cobb) bise) special disabilities 
2» Il four fields are concerned 
iii psychological differences among 

tual and classes of individuals. 
"x e stays within the psycho- 
LE 1c iramework in writing a well- 
Saanized review of the work on indi- 
y ^ differences. The discussion is 

r more on the interpretative 
n the evaluative side, except in 
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the parts concerned with the biologi- 
cal and social determinants of these 
differences. There she is critical of 
"hereditarian" inferences from in- 
adequately designed research. 

Bronfenbrenner's chapter is well 
written and it ranks high in the 
dimension of "'interpretation-evalua- 
tion." His introductory section 
states clearly his “biases” and his 
justification for them. Recognizing 
that “the attempt to deal... with 
multiple, interacting relationships 
rather than simple static entities of- 
ten leaves the theoretician with a sys- 
tem of ambiguous abstractions," he 
prefers the opportunities of the for- 
mer to the narrow constraints of the 
latter approach. 

White's chapter on abnormalities 
of behavior has a far more pro- 
nounced psychodynamic orientation 
than its predecessor by Zubin in 
Vol. 3. The latter stressed organic 
and genetic factors in his selection 
of references and in his general 
evaluative framework. White in- 
cludes such material—in particular 
a good discussion of biochemical in- 
vestigations of schizophrenia—but he 
emphasizes the influence of learning 
and life history upon abnormal be- 
havior. He concludes with the hope 
that “if we can arrive at more basic 
biological reaction patterns and also 
at more basic psychological reaction 
patterns these two classes of events 
will prove at last to be truly con- 
vergent.” In terms of its technical 
characteristics as a review, this chap- 
ter is one of the best in the book. 

Cobb defines a ‘‘special disability” 
as any defect or disability that may 
occur in an otherwise normally func- 
tioning person. Her bibliography is 
comparatively long, but its utility for 
readers is greatly reduced because it 
is not in alphabetical order. The 
material reviewed is a mixture of 
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basic and applied research on the 
etiology and psychological char- 
acteristics of visual, auditory, and 
speech defects, together with publica- 
tions on treatment and rehabilita- 
tion. The chapter is well written and 
is in general an excellent survey of a 
field not widely known to most psy- 
chologists. 

Social psychology. Newcomb opens 
his review with an interesting in- 
novation. By way of appraising 
efforts to define social psychology, he 
makes a comparative analysis of the 
content of six recent textbooks. His 
conclusion is that “social interaction” 
is the distinctive subject matter of 
social psychology, "that the term 
stands for something which can be 
studied at its own level." The phe- 
nomena of social interaction cannot 
be explained, he thinks, by ''mere 
extrapolation of general psychological 
principles.” 

Newcomb organizes his material 
under two main headings: attitudes 
and the processes of interaction and 
communication—without discussing 
the systematic interrelationships 
among the problems and concepts of 
the two areas. The presentations of 
research findings are exceptionally 
good, although there is less critical 
evaluation than in Smith's chapter in 
Vol. 3. 

Clinical psychology and counseling. 
Rotter sees the situation in psycho- 
diagnosis much as Magaret did in 
the preceding volume: “experimental 
investigation of test validity empha- 
sizes more and more the absence or 
inefficiency of prediction.” His re- 
view is well planned, and probably 
has as much structural unity as a 
good theoretical orientation can give 
to an appraisal of work on a multi- 
tude of incommensurable tests. There 
are three broad headings: (a) general 

contributions to the methodology and 
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theory of personality measurement; 
(b) clinical instruments; (c) research 
instruments. There is a good sum- 
mary at the end called ''Analysis of 
Trends.” Reviewers in other fields 
would do well to emulate this feature. 

Sanford's account of psychother- 
apy is not as systematic or compre- 
hensive as Raimy's chapter in Vol. 
3, although the chapter as a whole 
is a balanced interpretative appraisal 
in which clinical and scientific values 
are happily blended. A good report 
is given of the work in group therapy 
at the Tavistock Clinic in London, 
based upon personal observations 
over a period of several months. The 
longest section in the review is de- 
voted to research on psychotherapy, 
which Sanford regards as a fruitful 
means to the general investigation of 
personality. 

Williamson classifies the literature 
on counseling into (a) publications 
oriented towards psychotherapeutic 
theory and techniques, and (b) those 
focused upon “the choosing of oc- 
cupational goals based upon the diag- 
nosis of aptitudes and interests." He 
hasn't much to say about therapeutic 
studies, devoting most of the review 
to investigations of various diagnostic 
instruments. There is very lit 
interpretation or evaluation. 

Educational and industrial psychol- 
ogy. Superficially dissimilar, these 
two fields have many points of meth- 
odological and technological resem- 
blance. Both involve a broad range 
psychological principles and practices 
designed to improve the effectiveness 
of fundamental social functions. Bo 
draw upon all levels of psycholog! 
science—general, individual, and Lo 
cial. Perhaps for this reason, neither 
represents a unified scientific-tec?” 
nological speciality—as the two chap- 
ters in Vol. 4 will attest. 

For Carter, educational psychol- 
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ogy deals with school learning and its 
correlates. His review begins with a 
brief outline of the problems and 
conditions which further define the 
field. Such topics as readiness for 
learning, acquisition of desirable at- 
titudes, meaningful learning, socio- 
logical correlates of learning, emotion- 
al factors in learning, and measure- 
ment problems predominate. Very 
little work on the learning of specific 
school subjects per se is reported. 
And there is almost no relationship 
between this literature and that re- 
viewed by Underwood for the general 
psychology of learning. On the whole, 
this review is a good one, in marked 
contrast to that of Elmgren in Vol. 3. 

The lack of evaluation in the re- 
views of industrial psychology has 
already been noted. Harrell does, 
however, present a fairly extensive 
introduction in which over-all trends 
are effectively discussed. Major em- 
phasis in the body of the review goes 
to studies of human relations in in- 
dustry. As a rule these field studies 
appear to be inconclusive because of 
such factors as inadequate samples, 
uncontrolled conditions, and uncer- 
tain criteria. 

Statistical theory and research de- 
sign. Mosteller's chapter is consider- 
ably longer than its predecessors in 
earlier volumes, partly because it 
Covers a wider range of topics. Spe- 
cial features of the review are discus- 
sions of nonparametric statistics and 
ranking methods, which are being 
ee a reasingly by psychologists 
ie ata often fail to satisfy the 

umptions of the more conventional 
Parametric methods of analysis. 
Which, is an informative section in 
one poe dealing with the effects 

Shot 7 common parametric statis- 
pa ge from assumptions 
is “thet ‘he . The general conclusion 

epartures from assumptions 


may have little effect at some points 
of the distribution function, say near 
the mean or median, but far out in 
the tails of the distribution errors get 
to be large and the model may just 
no longer be appropriate.” He sug- 
gests that it would be useful to have 
empirical studies of the effect of 
violating the assumptions to varying 
degrees on this model. His attitude— 
not unusual in mathematical statisti- 
cians of empirical bent—seems to be 
that while illicit inferences are to be 
avoided, unnecessary scrapping of 
information is likewise to be mini- 
mized. Questions of the utility and 
limitations of various other mathe- 
matical "models" are discussed in a 
special section, including efforts to 
develop probability models for learn- 
ing data, uses of information theory, 
and difficulties in finding a suitable 
model for accident proneness. 
Theoretical psychology. This is 
Bergmann's term for the logic of 
psychology. Asa branch of the phi- 
losophy of science, theoretical psy- 
chology in this senseis concerned with 
the nature and structure of psycho- 
logical concepts, with the laws in 
which they occur, and the theories 
into which these laws combine. Fol- 
lowing this definition of the field, the 
author outlines the essential tenets of 
a “philosophy of psychology” on 
which he thinks virtual agreement 
has been reached, and devotes the 
remainder of the chapter to a critical 
analysis of deviations from tl;ese pre- 
cepts? First, certain dissents from 
this position in principle are noted: 
(a) the phenomenology of Snygg and 
Combs as regards the derivation of 
concepts; (b) concerning laws, Lon- 
don's antideterminism; (c) Skinner's 
attack upon theory. After brief refu- 
2 To designate this general methodological 


position, Bergmann uses the terms "neobe- 
haviorism" or “logical behaviorism.” 
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tations of these attacks, Bergmann 
gets down to the main business of the 
chapter which is the “prophysiologi- 
cal" movement in theoretical psy- 
chology. This development has two 
foci: (a) the Meehl-MacCorquodale- 
Feigl argument for the use of hypo- 
thetical constructs as distinct from 
intervening variables, and (b) neoges- 
taltism as represented in recent arti- 
cles by Krech. Although Bergmann's 


BOOK REVIEWS 


analysis may not convince his antag- 
onists, I think it is an outstanding 
contribution towards the clarification 
of theissues. It is perhaps unfortu- 
nate that he had to use so vulnerable 
a target as Krech for his attack upon 
gestaltism. But the gestaltists are 
mainly to blame; very few of them 
have ventured into the realm of for- 
mal methodological argument. 
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SHAFFER, G. Witson, & LAZARUS, 
RicHarp S. Fundamental concepts 
in clinical psychology. New York: 
McGraw-Hill, 1952. Pp. xi4-540. 
$6.00. 


This book could well have been two 
separate publications. The authors 
have divided their efforts so that one 
half the chapters are written by one 
and the remainder by the other. 
There is not much similarity in ap- 
proach or in style between the two 
sections. The first eight chapters are 
a sort of annotated bibliography 
covering the clinical field. The cover- 
age is quite broad but not especially 
deep. The last seven chapters are 
largely devoted to psychotherapy and 
related topics. 

The book should be useful in a 
number of ways, one of which will be 
as an aid to students preparing for 
comprehensive examinations. It in- 
cludes a great deal of information and 
it direc:s students to other excellent 
sources. 

It does not, however, live up to the 
promise of its title. Probably a 
representative group of clinical psy- 
chologists could not agree completely 
about which concepts in the field are 
really fundamental. But it is likely 
that most of them would agree that 
concepts such as psychological deter- 
minism, frustration-aggression, func- 


tional autonomy, dynamic mecha- 
nisms, and partial reinforcement, to 
name a few, are central to the field. 
None of these is given more than 
cursory attention. Although psy- 
choanalysis is discussed briefly in 
two different sections, the ego mecha- 
nisms are not included. The kindest 
thing to be said for the discussion of 
the Oedipus complex is that it is in- 
adequate. 

The book is written in the first per- 
son plural. The editorializing, which 
appears frequently, will be annoying 
to some readers and stimulating to 
others. The authors often criticize 
other clinicians for careless valida- 
tion, slipshod methodology, and 
cloudy thinking. On the other hand, 
many readers are likely to take ex- 
ception to their undocumented state- 
ments about the therapeutic process. 
In discussing therapy the authors 
stress the importance of such things 
as self-knowledge on the part of the 
therapist, the mastery of a variety 
of techniques, and special forms 0 
therapeutic training. Such state 
ments as “insight or uncovering 
therapy is, of course, the preferr 
treatment for most adjustive diffi- 
culties,” or “reassurance is neces 
sary in all therapeutic situations, 
or “the patient must uncover re- 
pressed material” will strike some 
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readers as dogmatic and unsupported 
by firm data. 

It is likely that there would be 
no general agreement among clini- 
cians that a book dealing with funda- 
mental concepts in the field should 
include five chapters on psychothera- 
py. There would probably be fewer 
votes for the necessity of a chapter 
on physical and chemical therapies. 
The latter chapter is quite detailed 
down to citing dosage of Mitragol and 
methods of applying electrodes. Much 
material in the earlier chapters is 
abbreviated or omitted on the plea 
of space limitation. Judgments will 
differ on questions of what should 
have been included and what left 
out. Many will feel that the book 
overemphasizes therapy at the ex- 
pense of other concepts. 

One final objection will occur to 
some readers. The authors con- 
stantly refer to persons with be- 
havior disorders or adjustive diffi- 
culties as "patients" who “suffer” 
from "illnesses." This seems to im- 
ply a fundamental concept about the 
nature of the clinician’s problem 
which should be further examined. 
Thinking about problems is frequent- 
W afected by the words used in their 
ed and many psychologists 

el strongly that by calling people 
ye emotional difficulties "sick" 
nere are created unnecessary seman- 
tic barriers to effective action. 

Era work that tries to cover as 
i & pond as this one it is easy 
A : noce and to disagree with 
E ors' choices of material for 
dan n. The authors have made 

s attempt to cover the field. 
the à estimates of the success of 
Ret ort his vary with their agree- 
topics with the writers’ choices of 

emphasized. 

MA GEORGE W. ALBEE. 

wersity of Helsinki. 
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KLEIN, MELANIE, HEIMANN, PAULA, 
Isaacs, Susan, & RIVIERE, JOAN. 
(Eds.) Developments in psycho- 
analysis. London: Hogarth Press 
and Institute of Psycho-analysis, 
1952. Pp. viii+368. 30s. 

The psychoanalytic dictum that 
early infancy is the crucial develop- 
mental period has been a source of 
as much frustration as stimulation 
among psychologists. While no psy- 
chologist these days seriously ques- 
tions the importance of this early 
period, neither is any psychologist 
particularly satisfied with the evi- 
dence for its importance or the meth- 
ods for its study. The retrospective 
reconstruction of infancy through 
adult analyses, and the description 
of babies’ feelings and attitudes by 
way of adult empathy, have always 
left psychologists dissatisfied and im- 
patient. On the other hand, there 
are no conventional, generally ac- 
cepted psychological methods for 
interpreting the emotional signifi- 
cance of infant behavior. Conse- 
quently, there are many theories, 
little data, and deep frustrations. 

The present collection of papers, in 
adding much to theory and little to 
data, may strain the frustration 
tolerance of the psychological reader. 
For the most part, the book aims to 
clarify Melanie Klein’s theoretical 
position concerning early infancy. As 
usual, her contributions are imagina- 
tive, provocative, and controversial. 
A fair evaluation of the writings, how- 
ever, requires criticism on two dif- 
ferent levels. 

One may begin by considering the 
theoretical contributions alone. Here 
there is little question that real ad- 
vance has been made in the sharpen- 
ing and extension of Klein’s concepts 
beyond the position she maintained 
in The Psychoanalysis of Children 
(1932). The emphasis on Freud's 
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concept of death instinct, and its 
use as the source of basic infantile 
anxiety, provides a starting point for 
the first of two phases of infant de- 
velopment: the  paranmoid-schizoid 
position. A major defense during this 
early period is splitting—basically 
the splitting between the “good” and 
"bad" breast, but reflecting as well 
a splitting or nonintegration of the 
ego. Itisonly as the ego becomes able 
to sustain anxiety that the infant can 
adopt the more mature depressive 
position, which itself fosters defenses 
leading to greater ego integration. 
The relationship of this developmen- 
tal sequence (greatly oversimplified 
by the reviewer) to the growth of 
introjection and projection represents 
a major contribution to psychoana- 
lytic theory. 

Not everybody will agree with this 
expansion of theory. Many of the 
points made by Klein have been for 
years a source of controversy be- 
tween one English analytic group and 
certain European analysts. Too of- 
ten, this controversy has ended with 
mild name-calling: who is the “genu- 
ine" psychoanalyst and who the im- 
postor? Frequently enough, it has 
involved real questions as to the ap- 
propriateness of an extension of cer- 
tain Freudian concepts to the earliest 
months of life. 

One cannot really evaluate theoret- 
ical contributions alone, however. 
Internal consistency is never enough. 
The more closely and carefully the 
detailed accounts of postulated in- 
fant development are written, the 
more they cry for empirical test. The 
test is not to be found in these papers, 
although a suggestion as to Klein's 
methods of arriving at hypotheses is. 
The instances which confirm Klein's 
hypotheses, as reported in one chap- 
ter, are taken from published British 

studies of infant development, from 
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her own clinical observations, and 
from casual, sometimes secondhand 
accounts of what babies do, As 
always, Melanie Klein in her observa- 
tions is the sensitive, intuitive invest | 
gator from whom few meanings of 
infant behavior are hidden. Riviere 
and Isaacs add notably to her ac- 
counts. But there is no consideration 
of alternative hypotheses, no account 
of the negative case, no hint as to 
the path from observed behavior to 
intuitive interpretation to theory con- 
struction. 

Even in the absence of these requi- 
sites of scientific method, the psy- 
chologist might be satisfied if the 
hypotheses advanced were testable. 
A careful survey of Klein's notions 
suggests that some probably are sub- 
ject to test, although these are the 
peripheral rather than the crucial 
hypotheses. If one cannot study the 
content of the unconscious fantasy 
of a three-month-old infant, for exam- 
ple, one still may be able to vary the 
frustration to which he is subjected 
and observe the typical defenses. If 
one cannot be sure of the infant's at 
titude toward the breast as part 
object, one may still examine the 
relative need-satisfying properties 
food and attention. Although one 
may not yet be able to relate weaning 
to persecutory anxiety in the infant's 
thoughts, one may nevertheless | 
to the spontaneous play of wean 
infants as an index of the significance 
of lost objects to the child. 

It is an encouraging sign that nu- 
merous isolated articles in the litera- 
ture of child development touch upon 
just such points as these. Today their 
number is increasing. Indeed, m 
of Klein's theorizing sounds less 
bizarre now than it did in 1932, large — 
ly because we are gradually accumu; ; 
lating from infants observation? 
material which has been ordered i? 
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nalytic terms. The lasting 
importance of these papers may well 
lie not so much in the theoretical 
controversies they foster as in the 
direction they give to the systematic 
observation of infants. 
ANN MAGARET GARNER. 
University of Illinois, 
College of Medicine. 


GREENE, Epwarp B. Measurements 
of human behavior. (Rev. Ed.) 
New York: Odyssey Press, 1952. 
Pp. xxiv+790. $4.75. 


Readers familiar with the first edi- 
tion of this book will be prepared for 
the comprehensive scope of the re- 
vision and its vast compilation of de- 
tailed and up-to-date information 
about measuring techniques of all 
varieties. They will be disappointed, 
however, in their expectations of a 
More tightly integrated book with a 
clearer and more precise exposition of 
basic principles and concepts of psy- 
chological measurement. 

The fault may lie in the nature of 
the task the author set for himself. 
What was difficult in 1941 has become 
Virtually impossible a decade later. 
Measurement and evaluation devices 
have proliferated rapidly in all areas 
of psychology and the development of 

eir systematic rationale has not 
kept pace with empiricalapplications. 
Any book which attempts to encom- 
pass practically all the techniques 
(and little of the underlying theory) 
Which have been or are employed in 
quantifying and assessing human be- 
desi must suffer from some super- 
p and lack of coherence. And 
rre covered in this one is for- 
Pis le: from psychophysical meth- 
dien the Szondi test, from Thur- 
i s attitude scaling to OSS evalu- 
Sii procedures and Kinsey's inter- 

ied methods. ‘Unfortunately, the 
is also exasperatingly crammed 
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with minutiae, often irrelevant. Even 
the subjects who posed for the photo- 
graphs of test administrations are 
graciously identified by name. 

The introductory chapters provide 
only a weak foundation for support- 
ing this mass of material. For exam- 
ple, the problem of the nature of 
measurement is dispatched in a para- 
graph; the procedures for developing 
measuring instruments are given in 
the form of brief exhortations such 
as the following: “decide specifically 
what is to be measured and how;... 
secure a large number of sample 
items..."; or, "analyze the re- 
sponses to each item to determine 
such attributes as content. ..." The 
other introductory chapters deal with 
test nomenclature (in which a curious 
distinction is made between achieve- 
ment, aptitude, and psychological [!] 
tests) and the characteristics of a 
“good instrument." Apart from a 
cursory discussion of reliability, 
norms, and the questionable attribute 
of test "uniqueness," the author 
never develops the general method- 
ological considerations essential to an 
understanding of test construction 
and accurate interpretation of test 
results. Such questions as the extent 
to which psychological tests satisfy 
the criteria for measurement, the 
logic underlying the concepts and 
empirical determination of test reli- 
ability and validity, the selection of 
the standardization group, etc. are 
disregarded or only incidentally 
treated. Nor are the concepts and 
procedures included in the three chap- 
ters on “elementary statistics” (one 
of which deals with factorial analysis) 
effectively applied to the problems 
involved in evaluating and using 
tests. In this connection, it is regret- 
table that the chapters in the earlier 
edition on “persistent problems" 
were deleted. 
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The remaining 500-odd pages are 
devoted primarily to a description of 
specific tests and related tools. There 
are seven chapters concerned with 
tests for early childhood, individual 
tests of ability, group tests of ability, 
motor and mechanical tests, tests of 
special aptitude, educational achieve- 
ment tests, and those developed for 
military personnel in World War II. 
In each of these chapters, representa- 
tive tests are described and illus- 
trated, scoring techniques explained, 
and some of the research findings on 
the use of the test surveyed. Many 
new and valuable tests developed 
since the appearance of the first 
edition have been included. There 
are, however, a few surprising omis- 
sions like the WISC and the Merrill- 
Palmer whereas some comparatively 
ancient tests of limited value have 
been retained. 

The chapters on attitude-, interest-, 
and personality-measurement have 
been greatly expanded. They are 
preceded by a distressing introduc- 
tion to dynamic theory and struc- 
ture of personality. The need for 
such a chapter is clear since many 
projective tests are discussed. But 
the oversimplified and, in places, 
muddled account points up the un- 
realistic design of the book. Clinical 
tests like the Bender-Gestalt, TAT, 
Rorschach, and Draw-a-Person are 
described, for example, quite fully. 
Such material, however, cannot serve 
in lieu of training manuals, nor is the 
methodology sufficiently explored to 
give the reader insight into the com- 
plex problems related to the valida- 
tion as well as the clinical and research 
use of these tools. 

Some errors of fact and naiveté are 
understandable in a survey text of 
this scope. Still, inaccuracies like the 
following are too frequent: “the word 
trait...is used to refer to any 
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physical aspect of a person . . . or to 
any mental aspect such as speed of 
reading... ”; ‘‘the sum of the verbal 
test scores [on the W-B] yields a 
verbal MA... '*; “this [the standard 
deviation] provides a method of scal- 
ing scores... comparable with the 
best physical scales . .. "; "the K 
score [on the MMPI] is the number 
of answers omitted because the client 
cannot say or will not choose." 

The breadth of content combined 
with the unsystematic approach lim- 
its the audience for the book. It can 
probably best serve as a reference 
text for advanced students who wish 
to become acquainted with a wider 
variety of tests than the standard 
laboratory and practicum courses are 
able to cover. The profusion of visual 
illustrations, the research bibliogra- 
phies, and the fairly complete classi- 
fied lists of available tests and in- 
ventories should be particularly help- 
ful for this purpose. 

EVELYN RASKIN. 

Brooklyn College. 


BAUMGARTEN, FRANZISKA. (Ed.) La 
psychotechnique dans le monde mo- 
derne. (Psychotechnics in the mod- 
ern world.) Paris: Presses Uni- 
versitaires de France, 1952. Pp. 
xi+630. 2,000 fr. 


Appropriately, these Proceedings 
of the Ninth International Congress 
of Applied Psychology were published 
in the series Bibliotheque Scientifique 
Internationale (Sciences Humaines 
Section Psychologie, with H. Piéron 
as the chief editor), designed princi- 
pally as means for acquainting thé 
French-speaking psychologists Wit 
important developments in the fiel 
of scientific psychology. In this series 
were published the translations © 
Rorschach's Psychodiagnostics (to- 
gether with the Bóchner-Halpern vor 
ume on clinical applications o if 
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Rorschach test); G. H. Thomson's 
treatiseon factorial analysis of human 
ability; and H. J. Eysenck's work on 
the dimensions of personality; and a 
large number of books by American 
authors (W. H. Sheldon's Varieties 
of Human Physique, two volumes by 
A. Gesell and F. Ilg, C. Wolff's 
Human Hand, as well as R. S. Wood- 
worth's Experimental Psychology, 
C. T. Morgan's Physiological Psychol- 
ogy, and D. Krech and R. S. Crutch- 
field's Social Psychology). Among the 
few volumes by French authors are 
J. M. Faverage's book on statistical 
methods and a book on early child de- 
velopment by O. Brunet and I. 
Lezine. 

There has been an interval of a 
long, eventful 15 years between the 
Eighth Congress of the International 
Psychotechnical Association (Prague, 
1934) and the Ninth Congress (Bern, 
1949). The editor stressed in her In- 
troduction that the Congress has 
offered to the participants an oppor- 
tunity to become acquainted with the 
present, dramatically altered features 
of psychotechnology. 

To provide a historical perspective, 
Dr. Baumgarten, as the secretary 
general of the Association, requested 
reports on developments in the field 
of applied psychology in different 
countries during the war years. The 
communications were published un- 
der the title Progress of Psychotech- 
mics (1939-1945) by A. Francke in 

ern, The continued interference of 
Political ideologies with the handi- 
work of the applied psychologist is 
ieee) in the fact that the Polish 

elegation, in a letter of May 5, 1950, 
Withdrew the manuscripts of papers 
Prepared for the Congress, which they 
Were unable to attend. 

popar as a wise policy that the 
iic ings have not been limited to 

racts of the papers nor do they 
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reflect with photographic accuracy 
what has transpired. The papers 
have been frequently shortened and 
regrouped. Among the outstanding 
recent trends in psychotechnology, 
Dr. Baumgarten noted the concern 
with social problems (including hu- 
man relations in industry) and the 
search for tests of personality. 
JOSEF BROŽEK. 
University of Minnesota. 


Lawsue, C. H. (Ed.) Psychology of 
industrial relations. New York: 
McGraw-Hill, 1953. Pp. vii+350. 
$5.50. 


The stated purpose of this book is 
“to present in a reasonably concise 
and non-technical form some of the 
content of industrial psychology 
which . . . might be useful to people 
who work with or manage other 
people.” The reader will find that the 
authors have been very successful 
in fulfilling this purpose. New ma- 
terial is presented in a style readily 
understandable by the supervisory 
force and by other management per- 
sonnel. An exhaustive list of refer- 
ences at the end of each chapter 
makes it possible for the reader to 
investigate any topic in greater de- 
tail. 

The rigorous scientific approach 
to problems typical of the Purdue 
groupisevident throughout the book, 
and the application of the experi- 
mental method to industrial rela- 
tions is documented with practical 
examples. 

One example of the critical atti- 
tude exercised by the authors in 
evaluating the studies cited can be 
seen in the chapter on employee su- 
pervision, in which the results of a 
nationwide survey of foremen are 
reported. The original article indi- 
cated that many of the foremen felt 
they were not posted on company 
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policies and that union shop stewards 
and others had usurped some of their 
rights. The authors of this book 
point out that their conclusions can- 
not be justified by the basic data and 
that only a minority of foremen held 
these unfavorable views. 

Three chapters of the book are de- 
voted to employee and group rela- 
tions. These chapters are outstand- 
ing and should prove most helpful to 
management personnel. The discus- 
sion of the informal group in the 
work force will enable foremen to 
understand the reasons for its exist- 
ence. When the supervisors realize 
that the informal group insures the 
individual personal recognition in a 
highly impersonal system and sup- 
plements the formal organization 
structure they will be better able to 
carry out their duties. 

Naturally, no book contains all the 
material the reader thinks should be 
included since there are practical 
limits of size which must be observed. 
Throughout this book a few chapters 
would have been strengthened by 
the addition of new material or a 
more thorough discussion of the ma- 
terial presented. The chapter on 
motivation and discontent in indus- 
try is somewhat oversimplified even 
for the nonpsychological reader. 

It is implied that the industrial 
psychologist is able to determine the 
motivation for an employee's behav- 
ior by examining the work and home 
environment. If the reader is not 
aware of the effort required and the 
limitations of the present state of the 
art, he may well expect overnight 
changes in the work force and sub- 
sequently be disappointed when they 
do not occur. 

In the chapter on employee coun- 
seling it is suggested that in some 
organizations the first-line supervisor 
will have to double as employee 
counselor. This is undoubtedly true, 
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but not enough attention is dev 
to the fact that it would be a diffi 
task for the supervisor to functig 
this dual capacity. The supposit 
that the supervisor can set aside } 
interests and activities for the 
he acts as the employee couns 
questionable. 
Unfortunately, a major failing. 
this book is the fact that the auth 
ignored union-management rel 
under the Taft-Hartley Act. Tl 
vision for an enforced ‘‘coolin 
period under this Act has ve 
nite implications for industrial 
tions. The worker, the union 
steward, and the foreman must 
"carry on" as usual during a pel 
when negotiations would 
reached a climax under other con 
tions. Worker-foreman relatio! 
should also have been conside: 
the prestrike and poststrike pel 
This book represents an eff 


previously existed. 
helpful to supervisory personn 
will enable many people who 
have specialized training in th 
to understand and use ind 
psychological techniques. It 
ther suggested that this book 
prove valuable as a second tex 
management training courses. 
Joszeu W. Wis 
Dunlap and Associates, Inc. 


POWDERMAKER, FLORENCE 
FRANK, JEROME D. Group 
therapy: studies in methodol 
research and therapy. Camb 
Harvard Univer. Press, 195 
xv+615. $6.50. 


The meat of this book lies » 
numerous detailed description 
ical events arising in an eX 
series of group therapy sessio 
neurotic and schizophrenic V 
events which have been matm 
reflected upon and used as à basis ™ 
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helpful generalizations about the 

and technique of group thera- 
py. The authors and their collabora- 
tors are sensitive observers, conscien- 
tious reporters, and sensible commen- 
tators. The trimmings of the book 
lie in the research data reported, 
which are nice to have but not nearly 
so nourishing as the analyses of group 
meetings and the excellent com- 
mentaries thereon. 

The report is based on a group ef- 
fort lasting over a period of two 
years. Some 120 neurotic patients of 
a mental hygiene clinic and some 170 
chronic schizophrenics in a psychi- 
atric hospital were treated by 19 
different psychiatrists working with 
a total of 27 groups. A research team 
of eleven persons, including psychia- 
trists, psychologists, and social work- 
ers, observed and recorded the meet- 
ings and then worked together to 
make communicable sense of the re- 
sults of their experiences. An average 
of five patients attended the clinic 
groups, The groups of psychotics 
were more than twice as large, a 
circumstance reported as favoring 
therapy with these severely disturbed 
people. One group, and a fascinating 
one, was the entire population of a 
ward, about 85 patients. Most of 
the therapists were inexperienced in 
Working with groups at the start of 
the project, and though they seem to 
share confidence in catharsis and in- 
terpretation as sources of gain in 
Jerapy, their newness to the group 
Situation ensured considerable vari- 
mw in approach. Groups varied 
n size and in rationale of composi- 
EN some groups were open, others 
is te closed; duration of therapy 

aried from 8 to 175 meetings. In 
Ede group most patients re- 
ilr dues therapy concur- 
de ii uch sources of variability 
as a endid for generating hypothe- 

ut impose tangible limitations 
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on their verification. An observer 
sat with each group and was responsi- 
ble in collaboration with the thera- 
pist for recording what went on. 
Tape recorders were used as an aid to 
accuracy but not as a source of ver- 
batim transcripts. The accounts of 
sessions and of critical events were 
made more graphic and complete by 
descriptions of postures, facial ex- 
pressions, and side plays, and more 
pointed by the omission of material 
considered irrelevant. But at the 
same time, students of therapy who 
have come to find verbatim tran- 
scripts indispensable will find them— 
indispensable. 

Though weak research-wise, the 
book is packed, 600 pages full, with 
discerning observations about the dy- 
namics of interpersonal relationships 
in therapy and with good, practical 
suggestions for making group therapy 
work. At this stage of our knowledge 
of group therapy, the problem is not 
so much to validate its effectiveness, 
which the authors essayed with 
limited success, as it is to define the 
process and to describe rigorously 
just what goes on. This latter need 
the book meets admirably, for one 
approach to therapy and for a par- 
ticular population. It will have much 
immediate value for therapists and 
will be a rich source of hypotheses for 
more circumscribed and definitive 
studies. 

NicHoLAs HOBBS. 

Peabody College. 


GELLHORN, ERNST. Physiological 
foundations of neurology and psy- 
chiatry. Minneapolis: Univer. of 
Minnesota Press, 1953. Pp. xiii 
+556. $8.50. 


With the rediscovery of Freud and 
psychoanalysis during the thirties by 
organized American psychiatry, pop- 
ular interest swung away from the 
organic etiology of mental disorder 
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and tended to concentrate itself upon 
"dynamic," developmental concepts 
and explanations. This break with 
tradition was a healthy one, but as is 
so often the unfortunate consequent 
of such extreme pendular movements 
within a discipline, there was not 
only an overemphasis upon func- 
tional factors, but a real neglect and 
almost a degradation of the physi- 
ological approach. It has seemed to 
this reviewer, however, that the last 
few years have seen the beginnings 
of a corrective movement with the 
pendulum swinging back toward in- 
terest in the physiological, and some 
hope for a more healthy balance be- 
tween functional and organic influ- 
ences in our study and interpretation 
of psychopathosis. 

Professor Gellhorn's book is there- 
fore a very timely one. Its explicit 
purpose is to survey and evaluate the 
links which can be established be- 
tween experimental neurophysiology 
and clinical neurology and psychia- 
try. It begins with a survey of the 
basic factors regulating neuronal ac- 
tivity, proceeds through a discussion 
of the physiology and pathology of 
movement, the physiology of con- 
sciousness, and the physiology and 
pharmacology of the autonomic nerv- 
ous system to a series of stimulating 
and provocative integrative chapters 
on neuro-endocrine action, the physi- 
ological basis of emotion, condition- 
ing, homeostasis, and the phenomena 
of "constancy." The closing section 
on applications, relates the material 
to the clinical problems of the psy- 
choses and psychoneuroses, and shock 
and carbon dioxide therapy. 

This is admittedly a huge canvas, 
and no single author could be ex- 
pected to do the picture complete 
justice. — "Justice," however, is a 
relative term, and in this reviewer's 
opinion, Gellhorn has done an ex- 
cellent e There are some omis- 
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sions, and it is obvious that Gellhorn 
writes with greater detail and greater 
enthusiasm where his own experi- 
mental work is concerned. He has 
not hesitated to take sides on con- 
troversial issues and occasionally he 
has made specific suggestions for 
treatment. This is illustrated by the 
chapter on the restitution of move- 
ment aíter central lesions where he 
pummels the general concept of the 
“plasticity” of neural function, and 
offers suggested lines to be followed 
in re-education. The net result of his 
partisan attitude, however, does not 
interfere with the fairness or scope 
of his survey, and does add a flavor 
and a definite liveliness to his treat- 
ment. Considering the difficulty of 
the subject, the book is well written 
and reads relatively easily. Gellhorn 
has succeeded in his goal of demon- 
strating “the fruitfulness of the physi- 
ological method for the study of 
pathological phenomena and the re- 
wards for physiology itself of this 
type of research." Á 

For psychologists the book is time- 
ly and important. In an age which 
is marked by its absorption in func- 
tional explanatory concepts, a devo- 
tion to higher order statistical ab- 
stractions, and a resulting tendency 
to intellectualize and verbalize the 
problems of human behavior, Gell- 
horn presents fundamental physi- 
ological data whose explanatory pow” 
ers and predictive potential cannot be 
neglected. The book holds both à 
promise and a warning for contem- 
porary psychology and cannot 
overlooked. 

WiLLIAM A. HUNT. 
Northwestern University. 


Szonp1, L. Experimental diagnostics 
of drives. (Translated by Gertr" 
Aull.) New York: Grune 2% 
Stratton, 1952. Pp. x+220. $13.50. 


For two reasons the English trans- 
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lation of this volume, first published 
in German in 1946, is something of an 
anticlimax. Most psychologists in 
this country are now generally fa- 
miliar with Szondi's test and unusual 
theories through the work of his stu- 
dent and co-worker, Susan Deri. 
Furthermore, research results on the 
Szondi Test appearing in recent years 
are generally unfavorable. There is 


little need to touch on the test itself 


in this review despite the fact that 
the book contains the basic manual 
and, presumably, the validating evi- 
dence for the test as a psychodiag- 
nostic instrument. Borstelmann and 
W. Klopfer’s excellent review and 
critical evaluation of the research on 
the test appeared in the March 1953 
issue of this journal. Although Susan 
Deri claims that the test is not 
dependent upon the genetic theories 
of its author, Szondi apparently be- 
lieves that data obtained from the 
test results verify his theories. 

In Experimental Diagnostics of 
Drives Szondi presents a highly 
speculative theory of personality 
which, he states, is derived from a 
union of genetics and depth psychol- 
ogy (psychoanalysis) validated by 
the results of “more than 4000 ex- 
periments.” The several thousand 
experiments evidently refer to that 
many individual administrations of 
the Szondi Test to an unreported 
number of subjects from an unspeci- 
fied sample of the “general popula- 
tion,” and to undescribed criterion 
groups of abnormal subjects. His 
Preferred technique for citing evi- 
‘Sd consists in anecdotal descrip- 

Ons of single cases whose test pro- 
Ws ciis Seem to coincide exactly 
The t € principle under discussion. 
bie E also contains a number of 
5 Dna tables, but two of these 
Bias eled relative frequencies (as- 

),” and the others are related 
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in some undisclosed fashion to the 
“4000 experiments.” Sample N’s, 
means, and dispersions are not given. 

This book is definitely not con- 
cerned with research on motivation, 
at least as we know it. The inclusion 
of the word "experimental" in the 
title and in many chapter headings is 
misleading. Thus the reader is forced 
to assess the personality theory on 
the basis of its reasonableness and 
internal consistency. To the Ameri- 
can psychologist, the theory's rea- 
sonableness is immediately suspect 
from Szondi's preface which declaims 
that this "new approach serves as an 
independent means of psychodiag- 
nosis in the service of psychopathol- 
ogy, vocational psychology, psychol- 
ogy of delinquency, pedagogy and 
characterology.... ^ According to 
the working hypothesis repressed 
latent genes in the lineal (inherited) 
unconscious determine the choice in 
love, friendship, profession, sickness 
and death (sic)." 

The role of dominant manifest 
genes in this psychology of pre- 
destination is not mentioned. Moti- 
vation, he claims, is based on eight 
specific drive needs (factors) derived 
as arbitrary polarities from four in- 
dependent hereditary syndromes. He 
implies that the syndromes are ac- 
cepted by geneticists interested in 
mental disorder. As there is no 
qualitative difference in motivation 
between the normal and the ab- 
normal, the drive needs must be 
universals. 

The book may contain some in- 
tuitive, basic insights into human 
personality, but, as far as can be 
judged, Szondi's insights are largely 
Freudian constructions translated in- 
to a new and less reasonable frame- 
work. 

Victor C. RAMY. 

University of Colorado. 
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It has always been apparent that 
^, human beings learn a great deal dur- 
_ ing ‘infancy and childhood and that 
4 the résults.of this learning often con- 
__ tinue to affect behavior throughout 
_ the remainder of the life span. Long 
before the days of a science of be- 
1 ee naturalists speculated about 
"the possible: importance of early 
learning in the lives of lower animals. 
More than two thousand years ago 
Aristotle wrote: 
. Of little birds, some sing a different note 
from the parent birds, if they have been re- 
moved from the nest and have heard other 
5 singing; and a mother nightingale has 
observed to give lessons in singing to a 
me bird, from which spectacle we might 
viously infer that the song of the bird was 
not congenital with mere voice, but was some- 


thing capable of modification i 
and improve- 
ment (5, p. 121). x^ 


During the two thousand years after 
» Aristotle other writers compiled anec- 
| otes and life histories of various ani- 
mal species, but not until the emer- 
m of comparative psychology in 
; e closing years of the nineteenth 
entury did the subject of behavioral 
B epmeat attract serious scientific 
» est. Among the earliest Ameri- 

n workers to deal with this problem 
i d who in 1899 published an 
d icle on the albino rat as a useful 
fio E un psychological investiga- 
rM 9). Small's account included 
. Section dealing with behavior in 

ancy and adolescence. Eight years 
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EFFECTS OF EARLY EXPERIENCE UPON THE 
, BEHAVIOR OF ANIMALS 


FRANK A. BEACH AND JULIAN JAYNES 
. Yale’ University 


later, Yerkes’ book, The Dancing 
Mouse (132) appeared, and that 
treatise dealt among other things 
with ontogenetic aspects of behavior. 
In recent years the amount of at- 
tention paid to this subject has in- 
creased significantly, and the change 
is attributable to several different 
sources of stimulation. Some of the 
first systematic experiments dealing 
with behavioral development were 
generated by the problem of the rela- 
tive importance of “maturation” and 
“practice” in the perfection of simple 
response patterns, such as the peck- 
ing behavior of young chicks (117) or 
the swimming behavior of larval am- 
phibians (16, 47). Additional impetus 
was given to investigations in this 
field by the impact of Freudian 
theory (38), which led to various 
studies, such as those which limited 
food supply or feeding responses in 
young animals and then measured 
the effects upon certain types of be- 
havior in adulthood. A third influ- 
ence, slower to gain recognition in 
America, has derived from the ob- 
servations and theories of Konrad 
Lorenz and other European biologists 
who examined the effects of early 
social stimulation upon the adult be- 
havior of various species of birds. 
The most recent increase of inter- . 
est in the effects of early life exper- 
iences upon the behavior of adult 
animals is traceable to theories that 
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stress the importance of perceptual 
learning in infancy upon subsequent 
performance in tests of learning. A 
leader in this field is Hebb, whose 
The Organization of Behavior (52) has 
been directly or indirectly respon- 
sible for a number of experiments re- 
ported in psychological journals dur- 
ing the last two or three years. 

Because of the rapidly growing in- 
terest in this area of comparative 
psychology, we have assembled and 
evaluated as much as possible of the 
relevant evidence. This seems worth 
doing not only because of its useful- 
ness to students of animal behavior, 
but also because scientists interested 
primarily or exclusively in human 
psychology very frequently turn to 
the literature dealing with lower ani- 
mals in connection with discussions 
of human development. The area 
covered is a diverse one; it touches 
and illuminates many fields of scien- 
tific investigation; its findings bear 
ona wide variety of problems ranging 
from plant parasitism to human per- 
sonality; and its significance stretches 
from the genesis of sensory expe- 
rience to the theory of evolution. 
Since the resulting diversity is very 
extensive, we have found it most 
feasible to organize our review in 
terms of the types of behavior which 
investigators have measured after 
modifying the animal’s early expe- 
riences. 

As the reader will discover, we 
have given the term “experience” a 
very broad meaning and used it to 
include a variety of factors ranging 
from the type of food consumed by 
larval insects to the characteristics of 
visual stimulation experienced by in- 
fant chimpanzees. This plan has been 
followed because of our conviction 
that there is no clear-cut line of divi- 
sion between the effects of early "psy- 
chological” experience and certain 
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physiological antecedents of adult be- 
havior. The development of behavior 
can be examined from either the psy- 
chological or the physiological view- 
point, but this is an arbitrary, meth- 
odological distinction made purely for 
the sake of convenience. Any analy- 
sis which is confined to one of these 
alternatives is bound to be incom- 
plete and to'a greater or lesser degree 
misleading. - 


SENSORY DISCRIMINATION 
AND PERCEPTION 


Descriptions of the sensory and 
perceptual abilities of animals have 
very often implied the unwritten 
suggestion that these abilities are 
exclusively determined by innately 
organized neural mechanisms. Sev- 
eral recent studies tested this as- 
sumption by studying the effect of 
early sensory experience upon 
sensory abilities of the adult. 

Vision. Visually directed responses 
of laboratory rats are not modified to 
any marked degree if the animals are 
prevented from using their eyes dur- 
ing infancy. Animals of this species 
which have been reared in darkness 
proportionally adjust the force of 
jumping to a food platform at differ- 
ent distances, although during the 
first trials the accuracy of such jumps 
is inferior to that of normally rea 
animals (75). Rats reared in dark- 
ness are capable of size, brightness, 
and pattern discrimination after very 
little visual experience (49, 50), a? 
correspond to normal animals 1n 
their ability to generalize from one 
visual stimulus to another on tf 
basis of relative size or relative 
brightness (50). 

Investigations of the importance 
of early visual stimulation in ot 
subprimate vertebrates have shown 
more pronounced effects, and sugges 
the existence of marked differences 


n species. It is reported, for 
ple, that when à rabbit which 
been reared in darkness was first 
ght into the light, it assumed ab- 
al bodily postures and bumped 
obstacles as it moved about the 
environment (40). The eyeblink re- 
flex in the dark-reared rabbit was 
normal, but the latency of pupillary 
nse to light was retarded; it was 
ive seconds in duration. After one 
'month under normal illumination the 
visually controlled movements of this 
"animal had improved to normal. 
- This result was not attributed to 
anatomical degeneration, since exam- 
ination of the retinas and optic tracts 
"ef other rabbits reared in darkness 
revealed no structural abnormali- 
ties. Recent work, however, contra- 
dicts this earlier conclusion. Chemi- 
cal analysis of the ganglion cell layers 
of retinas from rabbits reared for the 
first ten weeks in total darkness 
showed a complete absence of the 
pentose nucleoproteins present in 
normally reared animals (13). The 
cause is presumed to be the lack of 
adequate visual stimulation during 
the postnatal period of development. 
Itis therefore possible that the visual 
deficit found in dark-reared animals 
is attributable to incomplete chem- 
ical development of the ganglion 
cells in the retina. 
Pigeons whose eyelids had been 
sewn together before they had opened 
haved abnormally when the lids 
E" opened six weeks later (88). 
ike the rabbit reared in darkness, 
these birds tended to assume abnor- 
) Mal postures and apparently were 
unable to avoid obstacles on the 
E of visual cues. Furthermore, 
d did not show the usual startle 
E when a hand or handker- 
ef was passed rapidly before their 
yes. Some reflexes, including pupil- 
ty responses to light, were unaf- 
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fected by the lack of previous visual 
stimulation, but optokinetic nystag- 
mus was impaired. In general, pos- 
ture, obstacle avoidance, and visual 
responses improved rapidly and were 
normal three days after the lids had 
been freed. 

A similar, partial visual depriva- 
tion was produced by another experi- 
menter (118), who encased the heads 
of ringdoves with translucent vinyl 
hoods when they were three days old, 
before their eyes had opened. This 
technique permitted diffuse uniform 
light to reach the retina, but pre- 
vented form perception. Two months 
later, some opacity was found in the 
vitreous humor of the eyes, but con- 
trary to the previously cited study, 
there were no differences in the opto- 
kinetic reactions of the hood-reared 
doves and normally reared birds. 
No observations on obstacle avoid- 
ance or posture are reported, but 
when first unhooded and forced to 
fly, the ringdoves did so erratically 
and with definite errors in space 
orientation. For purposes of some 
future experiments, the experimenter 
cut windows in the hoods of the 
birds when they were two months 
old, so that only the lower half of the 
eye could be used, and then tested the 
birds on a form-discrimination prob- 
lem. It was found that the hood- 
reared birds took significantly longer 
to learn a circle versus triangle dis- 
crimination than similarly tested 
controls, the conclusion being that 
early experience is a prerequisite to 
normal adult function in visual per- 
ception. 

The deleterious effects of prolonged 
visual deprivation are not limited to 
mammals and birds. Eyed fishes 
taken into light from lightless subter- 
ranean cave pools repeatedly bump 
into obstacles when swimming about 
in a strange aquarium. They fail to 
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seize food as it drifts downward 
through the water, but instead seem 
to rely on chemical sensations as they 
grub about in the sand covering the 
tank floor. Visual control of swim- 
ming and feeding appears gradually 
and is only complete after several 
weeks under normal illumination. 
Fish that were reared in the lab- 
oratory in lightproof tanks for two 
and one-half years behaved like those 
captured in underground lakes and 
did not show normal visual responses 
until they had spent about one month 
in the light (14). 

Most primates depend heavily 
upon visual cues, and it is not sur- 
prising that lack of visual stimula- 
tion in early life has pronounced ef- 
fects upon subsequent behavior. 
Chimpanzees that were kept in the 
dark for the first 16 months of life 
were comparable to the pigeons de- 
scribed above in that they showed a 
marked pupillary response to in- 
crease in light intensity, but opto- 
kinetic reflexes were impaired and 
there was no blink when an object 
suddenly approached the eye (105). 
These apes exhibited head and eye 
movement to follow a point of light 
in a darkened room, but steady 
fixation and obstacle avoidance on 
the basis of visual cues were totally 
absent. In one case approximately 50 
hours of visual experience elapsed be- 
fore the first signs of visual learning 
(reaching for the nursing bottle) ap- 
peared (105). These results may have 
been due to physiological changes 
resulting from a lack of visual stimu- 
lation, since ophthalmoscopic exam- 
inations at a later date revealed evi- 
dence of severe retinal and optic 
nerve degeneration (107). 

In a second experiment using 
chimpanzees, one animal was reared 
in total darkness to the age of seven 
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` to visual cues when such cues Ars 


months. A second was reared ii 
dark but exposed to diffuse light) 
image formation possible) for 
and one-half hours per day. A 
ape was taken from the dark ea 
and allowed to spend one and øl 
half hours in a normally illuminat 
room (106). Testsbegun when theani 
mals were seven months old sho 
that the individual with full visu 
perience for 90 minutes per day 
haved like a normal chimpanzee, M 
the partially deprived ape ( ec 
only to diffuse light), the blink re: 
sponse to objects approaching 
appeared after six days of visual 
rience, while in the totally depri 
ape (no light during the first parti 
theexperiment), the blink reaction 
not appear until 15 days. Steady fi 
ation appeared in 13 days in the par 
tially deprived, and 30 days in th 
tally deprived animal. Lea ing 
avoid a conspicuous visual stimu 
which was associated with ele 
shock took one or two trials in om 
for normal apes, 13 days of two tn 
per day for the partially deprived i 
dividual, and 15 days of two tria 
day for the chimpanzee reared i 
tal darkness. Ophthalmoscopi 
amination revealed evidence of 
nal degeneration in the totally 
prived animal, but no ocular à 
malities were found in the other twe 
animals (107). 1 
The evidence is fragmentary * 
at points inconclusive, but it d 
suggest that in at least some Spe 
of fishes, birds, lower mamm 
primates absence of the no 
amount of visual stimulation d 
the developmental period may 
in inability to respond adapt 


comeavailable to the individu 
defects do not appear to be 
nent but seem to disappear W 
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creasing experience in visually di- 
rected responses. 

Touch and proprioceplion. There 
is some reason to believe that some 
other types of sensory deprivation in 
infancy may affect subsequent be- 


havior. One anecdotal account de- 
scribes a young chimpanzee whose 
paws were badly injured in infancy 
and which, as a consequence, never 


indulged in the frequent climbing 
behavior typical of this species. It is 
reported that after its paws had 
healed, this animal rarely exhibited 
any tendency to climb and showed 
signs of dizziness whenever it got 
more than a few feet above the 
ground (53). 

A more recent experiment involved 
bandaging or encasing an infant 
chimpanzee's lower arms and legs in 
cardboard tubes and thus preventing 
the normal manipulative use of 
hands and feet (92). The restraining 
devices were in place, on the single 
subject, for 30 months, beginning 
with the fourth week of life. When 
the limbs were freed, the ape showed 
some forced grasping, which nor- 
mally disappears during the first 
postnatal month. Walking and sitting 
postures were abnormal. Grooming 

havior, so typical of the normal 
chimpanzee, was lacking, as was the 
almost universal tendency to cling to 
the human attendant when picked up. 

hen a normal chimpanzee is 
touched lightly at some point on the 

y, the usual response consists of 
vnd bringing one or both hands 
2" € locus of stimulation. In the 
E oua animal these responses 
M te either lacking or abnormally 

Ow and inaccurate. Though visual 
p ination in this animal was 
» pue ape had great difficulty 

earning to differentiate between 
tile stimulation of the right and 
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left hands when vision was excluded. 


FEEDING BEHAVIOR 


Pecking responses in birds. As 
noted in our introduction, the con- 
troversy regarding the relative im- 
portance of maturation and expe- 
rience in the shaping of behavior led 
to several studies of pecking behavior 
in chicks (12, 25, 87, 95, 112). These 
have been reviewed so often that they 
need not all be discussed here (see 
80). In general, it appears that 
birds which are prevented from peck- 
ing by being reared in darkness, and 
then tested for the accuracy of such 
behavior, tend to be somewhat infe- 
rior to normal controls at first, but to 
improve rapidly and equal fairly soon 
the performance of chicks that have 
not been deprived. The most inter- 
esting report from the point of view 
of the present discussion is the state- 
ment that dark-reared chicks, fed 
artificially from a spoon for two weeks 
after hatching and then placed in the 
light, never developed the pecking 
response. Put ina pen with an ample 
supply of grains and grit to peck at, 
“they all starved to death even in the 
midst of plenty” (95, p. 436). 

As noted earlier in this review, 
some animals that have been reared 
in darkness are incapable of normal 
visuomotor behavior. It may be 
that part of the failure of dark-reared 
chicks to peck normally is due to 
sensory or perceptual deficiencies 
rather than to delayed motor devel- 
opment alone. Preventing the peck- 
ing response for long periods after 
birth has not been found to cause the 
disappearance of that response in 
other birds. Ringdoves, for example, 
were artificially fed from three days 
of age and prevented from pecking 
by being reared with translucent 
hoods over their heads (118). After 
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as long as two months, the hoods 
were removed, and the birds devel- 
oped coordinated pecking at grain 
within 15 hours. 

The visual stimuli capable of evok- 
ing the pecking response can be modi- 
fied by early feeding experience. One 
group of chicks was reared for seven 
weeks in a cage which was illuminated 
exclusively from above, while in the 
cage of a second group the only light 
came through the translucent floor. 
Later the response to photographs of 
grains of corn was tested. In some 
pictures the corn was illuminated 
from above, in others from below. 
The chickens reacted selectively, 
pecking only at those photographs in 
which the lighting conditions corre- 
sponded to those under which real 
feeding had taken place (55). 

A different approach to problems 
of pecking activity has been sug- 
gested by Levy's theory that there 
exists a pecking “need” or "instinct," 
and that birds continue to peck until 
this need is satisfied or satiated. To 
test this hypothesis, one large group 
of hens was raised in a cage having 
an elevated floor of wire mesh, while 
a control group was kept on a dirt 
floor (77). Chickens kept on wire 
pecked at their cage mates until 
many birds were partially denuded 
of feathers. Comparable behavior 
was rare among the control birds, 
which could peck the ground. When 
transferred to ground cages at six 
weeks of age, and mixed with control 
cases, the chickens reared on wire 
vigorously pecked at the backs of the 
fully feathered controls, but this be- 
havior died out after two months. 
Another group of chicks was reared 
on wire floors for a longer period of 
time. It was found that the pecking 
of companions was progressively re- 
duced and ceased after four months, 
but when transferred to the ground 
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at seven months of age these birds 
again began to peck one another as 
vigorously as before. 

Nursing behavior in mammals. 
Levy has suggested a comparable 
theory for the sucking responses of 
mammals. It is proposed that under 


normal conditions, a "ncc1" to suck ` 


is satisfied as the infan' sucks the 
mother's teat or breast, but if the 
need is not satiated during nursing, 
nonnutritional sucking behavior will 
appear. The occurrence of chronic 
thumb sucking in some bottle-fed 
babies and the reported ear licking 
and tail sucking of bucket-fed calves 
have been interpreted in this way 
(76). In an attempt to test the hy- 
pothesis, four puppies were separated 
from the bitch at 10 days of age and 
fed from nursing bottles thereafter 
(76). Two dogs drank from nipples 
with very small holes so that to ob- 
tain their milk ration they sucked an 
average of 80 minutes a day. The 
other two animals were fed from 
large-holed nipples and consumed 
the milk supply in an average of 13 
minutes of nursing per day. 

It is reported that the rapidly fed 
puppies exhibited sucking movements 
after feeding time and made sucking 
noises while asleep. In addition these 
dogs tended to suck their own bodies 
and those of other animals as well as 
to lick inanimate objects in the ken- 
nel. None of these characteristics ap- 
peared in the two animals that had 
the longer periods of nursing during 
infancy. A more recent study has 
investigated the possibility of pro- 
ducing this effect through sucking 
deprivation at an earlier age (109). 
Three puppies were partially de- 
prived of sucking experience but ade- 
quately fed for the first 10 days of life 
and then returned to the mother 
During the deprivation period the € 
perimental animals showed a stronger 
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sucking pressure and more nonnutri- 
tional sucking than normal controls. 
These differences disappeared, how- 
ever, when the dogs were returned to 
the bitch, and there was no persist- 
ence of the compensatory sucking 
such as had been reported in the 
earlier experiment. 

Hoarding behavior in mammals. 
Laboratory rats do not normally 
hoard their food, as do many other 
rodents, but they can be induced to 
do so under appropriate conditions. 
The evidence is somewhat contra- 
dictory, but the results of some inves- 
tigations indicate that early expe- 
rience with food may have some effect 
upon hoarding in adulthood. Accord- 
ing to the first study of this problem, 
rats reared on powdered food hoard 
fewer food pellets than other animals 
that have always eaten pellets; and 
subjecting rats to semistarvation in 
prepuberal life increases their tend- 
ency to hoard food in adulthood, even 
though the animals are satiated at 
the time of testing (131). Results of 
à subsequent experiment suggested 
that partial starvation for 15 days 
immediately after weaning produces 
increased hoarding in adulthood, but 
only if the hoarding tests are pre- 
ceded by a period of food deprivation 
(59). In this same investigation it 
Was found that partial starvation 
later in life, beginning 12 days after 
Weaning, has no effect on food hoard- 
ing in adulthood. Attempts to re- 
Produce these results in later experi- 
ments showed hoarding differences 
Mieneen rats starved in infancy and 

ntrols, but the differences were not 
as statistically reliable as those found 
Previously (60). 

r more recent attack upon this 
: eral problem involved study of the 
E of infantile deprivation on the 
- ell ing of water-soaked cotton as 
às food (79). Albino rats of both 
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sexes were divided into three groups. 
For 15 daysafter weaning, one group 
was partially deprived of food, and a 
second group of water. A control 
group was maintained on ad libitum 
food and water until adulthood. At 
130 days of age, all animals were 
placed in special cages where hoard- 
ing tests were conducted, both before 
and aíter a period of deprivation. 
The food-deprived males were found 
to hoard significantly more food pel- 
lets than controls after adult depriva- 
tion, but no significant differences 
between deprived and control ani- 
mals were found among the females, 
or in the water-hoarding tests for 
either sex. The picture presented by 
such results is of a more complicated 
problem than was at first supposed. 

Positive results in studies of this 
type may be due to the prior acquisi- 
tion of pellet-seizing and pellet-carry- 
ing responses, as well as to an in- 
creased rate of eating in the experi- 
mental animals (82). This hypothesis 
has been tested, and it was found 
that rats deprived severely in in- 
fancy had a faster eating rate as 
adults (83). Presumably, such rats 
would need less time to eat in the 
hoarding tests and would therefore 
have more time to hoard extra pellets. 

The positive reports of increased 
hoarding consequent to infant feed- 
ing frustration have been rather 
widely cited as bearing upon psy- 
chiatric theory (57, 104). In view 
of the lack of clear-cut confirmation 
by subsequent experimenters, final 
judgment probably should be re- 
served. 


REPRODUCTION 


Among the most species-specific 
behavior patterns in any particular 
species are those having to do with re- 
production—courtship, fertilization, 
nest building, parturition, and the 
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care of the young. For a long time, 
these often unique stimulus-response 
organizations were classified as "in- 
stincts" and were thought to be ex- 
clusively determined by genetic con- 
stitution. | Evidence which shows 
that the occurrence of these responses 
is heavily dependent upon other 
factors, especially the early expe- 
rience of the animal, has recently been 
gathered from many widely divergent 
invertebrate and vertebrate species. 
It is worth stressing at this point that 
early experience appears to influence 
the kinds of stimuli that will later 
evoke reproductive activities, where- 
as the actual responses are less sub- 
ject to modification. 

Egg-laying behavior in insects. Sev- 
eral studies have shown that the 
particular chemical stimuli which 
call forth the egg-laying responses of 
gravid females of several insect spe- 
cies are determined in part by the 
type of food which the insect has 
eaten or been exposed to during the 
larval stage of life. The female saw- 
fly normally deposits her eggs on the 
leaf of a particular species of willow 
tree, but if forced to do so she will 
oviposit on a different kind of willow. 
In an early experiment, successive 
generations of females were forced to 
deposit their eggs on an unpreferred 
species, and, after four years, the 
descendents showed a preference for 
the new type of host plant (46). 
However, the mortality in this study 
was high, and the results could pos- 
sibly be explained on the basis of 
genetic selection. 

More carefully controlled studies 
have employed a choice method in 
which the insect’s tendency to ap- 
proach different hosts is tested in a 
Y-shaped tube. Thorpe and Jones 
(128) found that all adult females of 
the ichneumon fly are attracted to the 
odor of meal-moth larvae, which are 


FRANK A. BEACH AND JULIAN JAYNES 


the normal hosts. But insects that 
had been reared on wax-moth larvae 
during their own larval period were 
also attracted to the odor of larval 
wax moths, while insects reared on 
normal hosts were not. Immediately 
after they had hatched, normally 
reared females were brought into 
contact with meal-moth larvae, and 
as a result of this exposure they 
showed temporary attraction to the 
immature meal moths; but the at- 
traction was lost if the ichneumon 
females were isolated for 10 days 
(123). Similar changes in adult pref- 
erences produced by varying the 
type of food consumed in the larval 
stage have been reported for fruit flies 
(124) and vinegar flies (26). 

Thorpe (124) regards this type of 
modification of behavior in insects as 
a conditioned response and compar- 
able to imprinting phenomena in 
birds, which will be discussed later 
in this review. Whatever the expla- 
nation, these experiments do demon- 
strate: (a) that adult olfactory prefer- 
ences in some invertebrates can 
affected by early environment, and 
(b) that this type of effect can in turn 
influence egg-laying behavior in these 
particular species. It is important to 
add that this conditioning effect does 
not occur in all insects. When fe- 
males of the Colorado beetle were 
reared as larvae on a foreign type 
host plant, they did not oviposit on 
the new plant (McIndoo, as cited in 
Thorpe, 124), and descendents of a 
strain of chalcid wasps which had 
been confined exclusively to a foreign 
host for 260 generations still pref 
the ancestral host (111). k 

Sexual selection in fish and birds. 
The reproductive behavior of some 
fishes and of many birds is character" 
ized by the comparative importance 
of elaborate patterns of display a" 
courtship which are usually an indis- 


le preamble to the mating act 
/ The overt responses involved 
appear to be modified by prior 
jence, but in some cases the 
4 i that elicit display and court- 
have been found to be markedly 
lent upon certain kinds of 
‘arly experience. 

Although adult male cichlid fishes 
apparently show courtship and mat- 
"ing responses only toward members 
of the opposite sex, males that were 
initially reared in isolation attempted 
| ‘to mate indiscriminately with both 

males and females (93). 
T Passenger pigeons reared by foster 
Tingdove parents are said to have 
failed to mate with their own species 
: ms adults, but to have attempted 
Sexual union only with birds of the 
same species as their foster parents 
(23). Other ornithologists have noted 
Similar interspecies "sexual fixa- 
tions.” Turtledoves, domestic and 
| Wood pigeons, and magpies have been 
reported to pair with individuals of 
Other species if they are reared with 
that species (41). Unfortunately, the 
of experimental controls renders 
the interpretation of such observa- 
tions difficult, but the results suggest 
that in some species of birds, individ- 
Wals tend to mate with the species 
With which they have been associated 

| in infancy. 

Comparable effects upon adult 
Mating preferences are sometimes 
nd in birds that have been reared 
by human beings. Three male, blond 
Tingdoves were reared singly by Craig 
(24). When they were first put into 
& cage with a female dove, these 
Males directed their courtship and 
Soital responses to the hand of their 
Caretaker or to inanimate objects in 
Me cage. Eventually, normal mating 
» een the birds occurred. It has 
= en reported that a domestic turkey 

Cock, which had been reared by hu- 


e. 
> 
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man males, responded to men with 
the usual courtship and mating activ- 
ities, whereas women elicited the 
fighting and retreating reactions nor- 
mally reserved for rivals of the same 
sex (101). This cock never displayed 
to turkey hens, but would tread them 
or any similarly sized object if he 
were forcibly placed in the mating 
position. 

Mating behavior in mammals. No 
interspecies mating preferences due 
to special conditions of rearing have 
been reported for mammals, but 
other effects of early experience on 
sexual activity have been described. 
One early study of male rats led to 
the conclusion that prepuberal segre- 
gation of males increases the tend- 
ency of an animal to cross an electric 
grid when a male is on the opposite 
side and decreases the tendency to 
cross to a female (65). However, the 
interpretation of results obtained 
with the obstruction method is diffi- 
cult, and the unsystematic procedure 
of this experiment places its results 
in serious doubt. Other studies (7) do 
not support the suggestion that pref- 
erence for a masculine sex partner is 
influenced by the conditions of rear- 
ing. Rearing rats in isolation has no 
effect on the pattern of the copula- 
tory response itself either in males (7) 
or in females (122), but it does tend 
to increase the proportion of males 
that copulate in comparison with 
other groups that have been reared 
with females or with other males (7). 

An effect of prepuberal experience 
on adult sexual activity in the male 
rat has recently been reported (66). 
Male rats were reared in individual 
cages from infancy, but periodically 
throughout the prepuberal life phase 
they were exposed to age mates of 
either sex in standardized ten-minute 
tests. When tested with sexually re- 
ceptive females in adulthood, these 
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males displayed some copulatory re- 
sponses, but also indulged in a great 
deal of the wrestling and play be- 
havior that is characteristic of the 
immature rat. Ejaculation in the 
male rat normally follows a series of 
closely spaced intromissions, but in 
the experimental males, the fre- 
quency of intromission was greatly 
reduced by the interpolation of play 
reactions, and ejaculatory responses 
were therefore infrequent. 

Parental responses in mammals. 
Care of the young depends in some 
instances on experience gained before 
sexual maturity. Early experiments 
revealed that primiparous rats, con- 
fined in individual cages up to the 
time of parturition, made use of avail- 
able nesting material and cleaned and 
cared for their young in a biologically 
adequate fashion although they had 
never seen nests built or delivered 
litters (6). It appears, however, that 
nest building is in some fashion re- 
lated to earlier experience in the ma- 
nipulation of objects in the environ- 
ment. Femalerats that were reared in 
cages containing nothing which could 
be picked up or transported are re- 
ported to have built no nests at par- 
turition, even though the appropriate 
type of materials was available (108). 

In an as yet unpublished study, 
female rats were reared with rubber 
"Elizabethan ruffs" around their 
necks (Birch as cited in Riess, 108). 
This technique prevented the female 
from washing herself or from hav- 
ing any mouth contact with her own 
genitalia. The ruffs were removed 
shortly before parturition, and it is 
reported that the experimental fe- 
males failed to clean or to nurse their 
young. 

Effects of captivity or domestication 
upon reproductive behavior. The ef- 
fects of rearing animals in captivity 

sometimes include the appearance of 
sexual maturity at an earlier age than 
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has been observed under natural con- 
ditions, as well as extension of the 
breeding periods in adulthood. This 
Occurs in a variety of mammalian 
species including the ibex, wild boar, 
American bison, antelope, chamois, 
and others (53). In one controlled 
study, the offspring of wild grey rats 
(Norwegicus) were bred for 25 gen- 
erations in the laboratory. In the 
course of this time, the length of fer- 
tility in relation to the life span was 
doubled, although average litter size 
did not change (71). Some mammals, 
however, such as the polar bear 
(120), show no alteration in the 
length of the breeding season under 
captivity, and failure of normal sex- 
ual development has been reported 
in three gorillas reared in captivity 
since infancy, and also in the cheetah 
and Malayan bear (53). Whether 
these effects have some physical 
cause, dietary or otherwise, or are due 
to early experience is not known. 


GREGARIOUS AND FILIAL 
BEHAVIOR 


Some species of animals maintain 
colonies, schools, flocks, or herds 
throughout the year. In other spe- 
cies, formation of social groups 1S 
limited to the breeding season or to 
the time during which the mother and 
young must remain in close associa- 
tion. In both instances, study of the 
influence of early experience on social 
behavior has yielded important re- 
sults. 

Species recognition in insects and 
fish. Among invertebrates, such ag- 
gregations as have been studied seem 
to depend on some single specific 
stimulus. For example, each colony 
of ants, even within the same species, 
has a unique odor. Members of à 
colony recognize each other by means 
of this cue, and an ant contaminate 
with the odor of a different colony !$ 
usually attacked and killed (9). 13 
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possible that this olfactory basis of 
social organization is partly deter- 
mined by early experience, as is host 
preference in some parasitic insects. 
If normally antagonistic species of 
ants are placed together within 12 
hours after emerging from the pupae, 
there is no aggressive behavior be- 
tween them, and a permanent mixed 
colony can be established (32). The 
effects of this early "conditioning" 
are apparently permanent, for tem- 
porary separation of a mixed colony of 
two species after a month of associa- 
tion did not decrease their compati- 
bility. Even after seven months of 
separation, ants that were originally 
reared in a mixed colony did not at- 
tack those of the same odor with 
which they had been reared, though 
allother "foreign" ants were promptly 
attacked and killed (33). 

Visual stimuli are commonly in- 
volved in the social interaction of 
several kinds of fishes, and in some 
species responsiveness to a given 
visual stimulus is influenced by early 
experience. Adult jewel fish of one 
type show a blue-black color of the 
ventral surface during the brooding 
period. The fry of this species nor- 
mally have a positive response to 
blue-black objects and are unrespon- 
Sive tored. But young fish that have 
been reared by foster parents with 
brilliant red underbellies show a re- 
duced response to blue-black, and a 
Positive reaction to red (94). In 
other words, the sensory basis for 
filial grouping in these fishes is partly 
dependent on very early experience. 

Gregariousness in insects. The 
color, morphology, and aggregation 

ehavior of solitary and migratory 
locusts differ quite markedly, and 
at one time they were classified as 
Separate species. It is now known 
that these striking differences are 
phases" of the same species (129), 
and that the differentiating charac- 
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teristics are directly related to certain 
conditions existing during the insects’ 
development. In an early experiment 
it was found that progeny of the 
solitary locust could be transformed 
into the migratory type by rearing 
them under crowded conditions, 
whereas individuals reared in isola- 
tion never become migratory (31). 
This relationship of early environ- 
mental conditions and adult behavior 
in the locust has been confirmed in 
field observations (70, 130). 

A well-designed experimental anal- 
ysis of social behavior in locusts has 
recently been performed and further 
strengthens the above conclusions 
(28). Groups of locusts in the first 
instar (i.e., from hatching to the first 
metamorphosis) were reared under 
crowded conditions, and others were 
reared in individual jars. When the 
experimental animals had metamor- 
phosed, and were therefore in the 
second instar, aggregative tendencies 
were measured by placing groups of 
them in a ring-shaped cage, the floor 
of which was marked off in equal-size 
segments. At intervals after the be- 
ginning of the test, the numbers of 
insects in each cage segment were 
counted. By comparing the fre- 
quency distribution of segments con- 
taining various numbers of individ- 
uals with a Poisson series it was de- 
termined that locusts which had been 
reared in groups aggregated at a 
level well above chance, whereas in- 
sects reared in isolation were signifi- 
cantly less gregarious. 

When the formerly isolated locusts 
were kept under crowded conditions 
for 24 hours, there was a consider- 
able increase in gregarious behavior 
as measured by the same test, and 
after four days of group living, their 
social behavior was identical with 
animals reared in groups since hatch- 
ing. In contrast, the social behavior 
of locusts reared for the first instar 
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in crowded conditions was not ap- 
preciably affected by subsequent iso- 
lation. This suggests that in this 
species the tendency to aggregate, 
once it has been acquired in infancy, 
is irreversible. 

Gregariousness in birds. The ef- 
fects of early experience on social be- 
havior seem to be much more com- 
plex and extensive in most birds than 
in fishes. Lloyd Morgan observed 
that domestic chicks reared in an 
incubator are relatively indifferent 
to the hen and her flock (86), and 
fecent experiments indicate that 
hand-reared chicks respond very 
slowly, if at all, to the food-call cluck- 
ing of the hen (20). Mallard and 
Muscovy ducklings reared in an in- 
cubator do not show the normal se- 
lective response to adults of their own 
species. Qualitative measures of the 
tendency of isolated chicks to move 
toward another chick in a runway 


revealed that the approach was much 


slower than normal, with frequent 
pecking at the second bird. When 
they were placed with the flock, such 
chicks tended to keep apart from the 
group (19). These antisocial effects 
are maximized by rearing chicks in 
complete isolation away from any 
animal or human being. Chicks so 
reared are said to have developed a 
persistent and "'rage-like" restless- 
ness as well as unusual stereotyped 
behavior patterns, and to have re- 
acted in a "fearful" fashion when 
placed with the flock (Bruckner, as 
cited in Katz, 68). 

There is some reason to doubt that 
rearing chicks in isolation completely 
abolishes all positive responses to 
others of their kind. Chicks have 
been reared in isolation to the fourth 
day after hatching, and then tested 
for their choice between a stimulus 
compartment containing chicks and 
one containing mice. They spent 
more time in front of the former (97). 
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Interestingly enough, birds that had 
been raised in the flock showed no 
such preference. 

“Imprinting” in birds. In studying 
reproductive behavior of birds, we 
noted several instances in which the 
choice of sex object by the adult bird 
was influenced by the species with 
which the individual had been asso- 
ciated in infancy. That other social 
responses can become ‘‘fixated” in 
this way, especially toward human 
beings, has long been known. It is 
said to be true, for example, of the 
rhea bird (58). If eggs of the Euro- 
pean greylag goose are hatched in an 
incubator and the young are exposed 
to a human being just after hatching, 
the young birds respond in filial 
fashion to the human but not to 
adults of their own species (Heinroth 
as cited in Lorenz, 78). It is further 
claimed that incubator-hatched geese 
can be reared by mother geese only 
if they are placed in a sack imme- 
diately upon removal from the in- 
cubator, so that premature exposure 
to humans is prevented. To describe 
the process responsible for these phe- 
nomena, the German ornithologist, 
Konrad Lorenz, has coined the term 
Prägung, translated as “imprinting.” 
Imprinting takes place when the 
first stimulus to evoke a given "in- 
stinctive" reaction becomes, with 
extreme rapidity, the only stimulus 
that henceforth can evoke that re- 
action. According to Lorenz, im- 
printing has two properties which dis- 
tinguish it from other types of learn- 
ing. First, it can only take place dur- 
ing a specific and brief portion of the 
life span while the organism is in à 
critical physiological stage of devel- 
opment; and second, the results of 
imprinting are irreversible through- 
out the life of the individual. 

Lorenz confirmed Heinroth's ob- 
servations on the greylag goose 
Newly hatched goslings were exp 


— 
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oand allowed to follow the experi- 
inter and were prevented from see- 
ng, hearing, or being in contact with 
adult geese until the tendency to 
follow the human foster parent had 
heen firmly established. Thereafter, 
they tended to follow and remain 
ie to Lorenz and showed no posi- 
tive social responses to adult geese 
178). There is evidence to show that 
imprinting occurs in some other birds, 
; in those ground-nesting 
ies whose young are capable of 


is they are hatched. Young chickens, 
ducks, or turkeys have been reared 
"by foster parents belonging to one of 
the two other species and under such 
nditions have been found to form 
ss-species filial attachments (102). 
— A number of independent studies 
‘combine to indicate that susceptibil- 
to imprinting is a function of the 
age of the young bird. The coot is 
most easily affected by imprinting 
during the first 8 hours after hatching, 
and coots that are 24 hours old can- 
not be imprinted (1). The most 
thorough study of filial imprinting 
— has been made on ducklings (29). 
The young birds (tufted ducks, eid- 
€ts, and shovellers) were hatched in 
c An incubator under opaque hoods. 
__ At various durations after hatching, 
L thehoods were removed, and the duck- 
lings were allowed to follow the re- 
— treating experimenter. It was found 
that for all three species the follow- 
‘ing-reaction could be evoked, and 
imprinting established most easily in 
the first 6 hours and, as in the coot, 
Could not be obtained if 24 hours had 
“lapsed since hatching. 
‘ E^ all birds followed the experi- 
E without his also making audi- 
q oy Signals. Such sounds increased 
inc probability of evoking the follow- 
een and at least in the tufted 
| wi , the shoveller, and the mallard 
emed to have a stronger releasing 
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effect than visual stimuli. Birds that 
had been imprinted subsequently 
followed the experimenter whether 
he walked, swam, or rowed a boat, 
and paid no attention to adult ducks. 
After three weeks, the following-re- 
action to humans gradually dimin- 
ished. 

In tree-nesting birds, whose young 
have a slower behavioral develop- 
ment, the formation of social attach- 
ments takes a somewhat different 
course. The initial phase of imprint- 
ing to a parental object is believed to 
occur as it does in ground-nesting 
birds, but the results do not become 
irreversible until some time later. For 
example, jackdaws taken from the 
nest at 14 days of age were found to 
be imprinted on their natural parents, 
but after separation from adult jack- 
daws and continued association with 
humans from this time on, the birds' 
positive social responses could be 
shifted to human beings (78). After 
20 days, when the young jackdaws 
are fledged, such shifts in social re- 
sponsiveness cannot occur. Those 
that have been with adult birds can- 
not be imprinted to humans, and 
those that have earlier been im- 
printed to humans cannot be induced 
to react normally to their own kind. 
These preliminary findings on one 
insessorial species are of interest, 
but they require a more thorough 
conceptual and experimental analysis 
before they can be generally accepted. 

There are some avian species in 
which parental imprinting to humans 
does not occur. For example, some 
wading birds, such as curlews and 
great godwits, show the escape reac- 
tion to man immediately after hatch- 
ing in an incubator, and hence im- 
printing on man is impossible (78). 

It is interesting to consider the 
possible role of imprinting in the 
evolution of species. It seems highly 
probable that it operates as a SOCH 
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and reproductive isolating mecha- 
nism in the case of sympatric species. 
That it may also play a role in foster- 
ing the geographical isolation of some 
allopatric species should be of inter- 
est to ecologists. Thorpe has pointed 
out that the significance of imprint- 
ing may not be confined to the for- 
mation of bonds with other living 
creatures, but may apply also to 
bonds with inanimate aspects of the 
environment (126). Evidence for this 
is scant, but it is known that acoustic 
signals (29) or boxes and footballs 
(102, 127) can be used as imprintable 
stimuli for various birds. Validation 
of this hypothesis might aid in under- 
standing the formation of ecological 
races. For example, peregrine falcons 
nest in cliffs in one region and in trees 
in another, but in periods of over- 
population, the cliff-district birds un- 
adaptively attempt to nest in un- 
suitable cliffs rather than switch to 
trees like falcons of other districts 
(56). This may be due to imprinting 
of the young falcons to cliff nests. 
Song acquisition in birds. The songs 
of birds are an important mechanism 
in their social organization, especially 
in the breeding season, and it is 
widely believed that in some cases 
these songs are influenced by audi- 
tory experience acquired early in life. 
For some species, such as the rhea 
bird (100) and roller canary (85) no 
such influence is found, but if young 
common canaries are allowed to 
hear the recorded song of the night- 
ingale they develop a song interme- 
diate between the normal canary and 
nightingale songs (61). In another 
study,  bluebirds, robins, wood- 
thrushes, orioles, bluejays, and red- 
winged blackbirds were reared to- 
gether in a large room. At maturity, 
none of these birds sang the normal 
song of its species (115). Similar 
modifications of the adult song result- 
ing from conditions of rearing have 


been reported for other species in- 
cluding bobolinks, grosbeaks (116), 
sparrows (21), whitethroats, tree 
pipits, meadow pipits, greenfinches, 
chaffinches, and yellow buntings (54). 

Song acquisition in birds has been 
considered as a special type of im- 
printing (127). This interpretation 
is supported by the report that some 
birds, like the chaffinch, never ac- 
quire the normal song if they have 
developed an abnormal one in early 
life (43). That this also may be a 
case of an isolating mechanism di- 
rectly related to early experience has 
been suggested by Mayr (84) in his 
discussion of the song races of chaf- 
finches about Stuttgart. A male chaf- 
finch, who had by some chance as- 
sociation with other birds learned a 
different song in infancy, would be 
expected to employ this song in court- 
ship. If he were successfully mated, 
the progeny would then be exposed 
to the special song, acquire it, and 
use it in their courtship later, and in 
this manner a new species or sub- 
species might develop. Thus it is 
seen that in birds early experience 
can exert a marked influence upon 
the future social life of the individual, 
and also perhaps upon the future of 
the species. 

Social behavior in mammals. Stud- 
ies of the social effects of early expe- 
rience in mammals are relatively few. 
Naturalistic observations reveal that 
lambs of wild sheep show something 
like the imprinting process in birds. 
When captured and reared by hu- 
mans, they do not later rejoin the 
herd, but follow and remain with 
human beings (89). Domestic lambs 
that have been reared on a bottle 
exhibit similar behavior. When 
turned to pasture with the flock, 
they show difficulties in adjusting to 
other sheep and tend to graze by 
themselves (112). 

Scott and Marston (113, 114) have 


£ 
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T outlined a theory to the effect that 
the development of young puppies is 
divisible into several sequential peri- 
ods, certain of which are "critical" 
in the sense that during these rela- 
tively brief. phases the young dog 
forms habits or develops tendencies 
which have profound effects upon 
social behavior in later life. 

There is no experimental evidence 
to establish the point, but it may be 
noted that the parturitional behavior 
of certain wild ungulates is such as to 
allow maximal opportunity for new- 
born young to form social bonds to 
the mother within a very short time 
after birth. Female red deer and 
American elk associate in herds 
throughout most of the year, but just 
before calving, the pregnant female 
leaves the group and seeks seclusion. 
For several days after parturition 
the young remain isolated and hid- 
den, and are visited periodically by 
the mother. Then both the female 
and her offspring rejoin the herd, and 
thereafter the calf's following and 
suckling responses are directed solely 
to its own mother, while she in turn 
permits only her own young to re- 
main close to her and to nurse (2, 
27). The possibility suggests itself 
that during the brief period when the 
calf’s social contact is restricted to 
its mother, imprinting or some similar 
process takes place. 

The chimpanzee is a highly social 
animal and may be considerably dis- 
turbed by temporary isolation (73, 
91). The object of its social impulses 
appears to be influenced by expe- 
hence in infancy, just as it is in many 
birds and in some subprimate mam- 
mals. Several observers (48, 63, 69) 
Who have reared infant chimpanzees 
in their homes have noted how easily 
at this age the social reactions of 
a animals can be transferred to 
4 man adults. One such chimpanzee 

as later returned to the laboratory 
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and caged with others of its own spe- 
cies. For several months he did not 
mix with other apes, but looked 
“longingly” at human beings who 
were visible to him beyond the cage 
(Finch, as cited in Nissen, 91). 


EMOTION AND TEMPERAMENT 


Wildness and timidity. To the 
human observer, animals in nature 
display characteristic behavioral 
tendencies that are often described 
as wildness, timidity, or aggressive- 
ness. Some of the laboratory studies 
cited in this review indicate that cer- 
tain emotional responses in animals 
are often directly related to expe- 
riences acquired early in life. In birds, 
we have seen how rearing in isolation 
produces unusual emotional behavior 
(Bruchner, as cited in Katz, 68), 
fearfulness when placed with the 
flock, and a tendency to peck at other 
birds (19). Another study involved 
rearing domestic chicks in small 
wooden boxes inside the brooder 
where other chicks could be heard 
and their shadows seen. Even this 
partial isolation resulted in fighting 
behavior when each of these birds 
was paired with a normal chick at 
seven weeks of age. Fear reactions 
and “bewilderment” when placed 
with the flock were also reported (15). 

“Wildness” involves both genetic 
and experiential factors. Among the 
latter, one significant natural vari- 
able for some species seems to be the 
temperament of the mother, Black 
ducks and mallard ducks are equally 
wild in nature, but the mallard’s 
wildness is readily modifiable when 
the birds are hatched and reared by 
tame foster parents (98). Wild 
herring gulls that have been reared 
by the somewhat tamer common 
tern do not display the usual herring 
gull’s timidity toward man (Kuhle- 
man, as cited in Thorpe, 127). 

The tameness of the mother seems 
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to be an important factor in the wild- 
ness of some small mammals. Rats 
reared by tame females are more doc- 
ile toward humans than those reared 
by wild mothers (103), though the 
taming effect is said to be cancelled 
out if a wild father remains in the 
same cage with the family (110). In 
another study it was found that de- 
scendents of wild gray rats reared in 
captivity for 25 generations lost their 
“high nervous tension and fear of 
man,” although in this instance 
there was no control of the possible 
effects of selective breeding in suc- 
cessive generations (71). One com- 
monly recognized early-experience 
factor affecting wildness is associa- 
tion with human beings. Lion cubs 
taken from the mother and reared by 
hand are tamer than those nursed by 
the lioness (22). 

It has been suggested that severe 
trauma in infancy can affect emo- 
tional reactivity in later life. Young 
mice from a strain that is highly sus- 
ceptible to audiogenic seizures when 
stimulated with the sound of an elec- 
tric bell were exposed to this stimu- 
lus from the fourth day to the seventh 
day of age. No seizures resulted in 
these infant animals. However, 
when they were placed in the same 
situation at a later age, but not given 
an auditory stimulus, the “trauma- 
tized" mice displayed a higher fre- 
quency of eliminative responses than 
did litter-mate controls (44). These 
findings were originally interpreted 
as indicating a general increase in 
"emotional instability," but the test 
for emotionality was made in the 
same environmental setting where 
traumatization had previously been 
attempted. It is therefore impossible 
to rule out the alternative explana- 
tion that the results were due to di- 
rect and specific conditioning. In a 
similar investigation rats were used 
as subjects. Infant animals were ex- 


posed to a variety of supposedly 
traumatic stimuli and tested for 
emotionality in adulthood. No dif- 
ference was found between experi- 
mental and control groups in an open 
field test or in susceptibility to audio- 
genic seizures (42). 

In a study of emotionality in dogs, 
three Scottish terriers were raised as 
pets with a great deal of human 
handling, and another three were 
reared in restricted laboratory cages 
with a minimum of human contact. 
As adults, the cage-reared animals 
differed from the others in that they 
displayed "freezing" behavior in an 
unfamiliar environment, hugging the 
floor and staring forward, as well as 
a behavioral "anesthesia" to a hypo- 
dermic needle (18). 

Aggressiveness. As noted earlier, 
chicks reared in isolation tend to 
peck other chicks more than do 
group-reared birds, and the fighting 
behavior of adult mammals of some 
species is markedly influenced by 
certain types of infant experience. 
Hungry infant mice were trained to 
fight over food shortly after weaning. 
As adults, these animals fought over 
food even when not hungry. Litter- 
mates, raised without competitive ex- 
perience, did not show this behavior 
except under conditions of starvation 
(36). 

Early defeats in a fighting situa- 
tion have been shown to influence ag- 
gressiveness in later life. Mice of dif- 
ferent ages were exposed to and at- 
tacked by a second mouse that had 
been previously trained to fight. The 
results showed that the younger the 
mouse was at the time of its initial 
defeat, the greater was the persevera- 
tion of nonaggressive, withdrawal 
responses in a fighting situation in 
adulthood (67). In experiments of 
this type, it is difficult to conclude 
whether the results are due to the age 
variable per se, or to the possibility 
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that a young animal would expe- 
rience a greater amount of trauma 
than an older animal. 


LEARNING 


Many psychologists appear to con- 
ceive of learning ability in animals as 
genetically determined and relatively 
unmodifiable, but recent findings in- 
dicate the untenability of this thesis. 
Harlow (45) and his co-workers have 
demonstrated the development of 
“learning sets" in primates, wherein 
an animal “learns how to learn," and 
other studies have shown that stimu- 
lus-response learning in rats is sig- 
nificantly facilitated when they have 
previously learned quite different 
responses to the training stimuli (64, 
74). 

Results of a second group of ex- 
periments indicate that an animal's 
performance on special "intelligence 
tests" is directly related to certain 
kinds of experience in infancy. In one 
way or another, most of these experi- 
ments are outgrowths of a theory pro- 
posed by Hebb (52). In brief, this 
theory predicts that animals that 
have had a large amount of percep- 
tual experience early in life will prove 

tter learners than others deprived 
of Such experience. Further, it is pre- 
dicted that the magnitude of this fa- 
Cilitative effect is, within limits, in- 
versely related to the age at which 
the Perceptual experience is gained. 
Eu for this view has been 
ape largely by varying the en- 
ED in which the animal ma- 
p ats reared from 21 to 51 days 
o ee in a small room and not in 
Pte a found to be superior in 

sm earning to other animals that 
E reared in small cages or in 
"onem boxes" (11). In a more 
i en y controlled study, one group 
ton. as PM in a large cage, and 
feared S of a second group were 

individually in mesh cages 
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which were periodically moved from 
one position to another in this large 
cage. A third group was kept in en- 
closed activity wheels, and a fourth 
group in enclosed stove-pipe cages. 
Rats in the first two groups per- 
formed significantly better on the 
Hebb-Williams maze than did the 
latter two groups. It was concluded 
that the visual perception in a spa- 
cious environment during early life 
facilitates adult learning ability (62). 
A further study confirmed the pre- 
ceding results and added the fact that 
having early access to pieces of ap- 
paratus somewhat similar to parts of 
the Hebb-Williams maze further in- 
creased learning scores on that par- 
ticular test (35). 

Rats reared as pets in the home 
also show definite superiority in cer- 
tain learning situations when com- 
pared with animals reared in lab- 
oratory cages (51). In this connec- 
tion it should be noted that the mere 
handling of rats from infancy im- 
proves performance on a T maze in 
adulthood (8), and it is possible that 
the difference between the home- 
reared and laboratory-reared animals 
might be due to this factor. 

The age at which the extended en- 
vironment is experienced is an im- 
portant factor. Rats reared for a 
period of time early in life in a large 
space are clearly superior on the 
Hebb-Williams maze to animals that 
are first kept in small cages and then 
put into the more favorable environ- 
ment at a later age (62). Supporting 
the same conclusion in another way is 
the fact that rats blinded early in 
life are inferior in maze performance 
to rats blinded at a later age (51, 62). 

One possible explanation for these 
differences may be that rats reared in 
a more spacious and heterogeneous 
environment develop a superior abil- 
ity to use distance orientation cues in 
learning. This hypothesis is in accord . 


256 


with the finding that being raised in 
a large cage does not improve the 
rat's ability to learn a simple alley 
maze (62), or a visual discrimination 
problem (11). Presumably distance 
cues are less important in such tests 
than in more complex measures of the 
Hebb-Williams type. Positive evi- 
dence for the hypothesis is seen in the 
fact that rotation of the Hebb-Wil- 
liams maze interferes much more 
seriously with the performance of rats 
reared in large than those reared in 
small cages (35, 62). 

Exploratory tendencies may also 
be involved in the differences under 
consideration. One of the first studies 
of this problem showed that the tend- 
ency of rats to explore a novel en- 
vironment in adulthood is increased 
if the animals have been reared in 
large instead of small cages (96). 

A few experiments have been con- 
ducted to extend some of these find- 
ings to other species. For example, 
dogs reared as household pets are de- 
scribed as superior in learning ability 
to others raised in laboratory cages 
(18). However, it is impossible to 
estimate the degree to which the 
marked emotional differences be- 
tween the two groups of dogs, as dis- 
cussed earlier in this review, may 
have contributed to differences in 
test performance. 

A quite different aspect of infant 
experience that affects an animal's 
learning ability in adulthood is the 
amount of food available. Rats de- 
prived of food in infancy, and thus 
morphologically stunted, show higher 
activity scores as adults, and take 
less time to learn a maze for a food 
reward than do normally reared ani- 
mals (3, 4). This finding holds true 
for mice as well (72). A later study 
showed that these results may have 
been due to the fact that the physio- 
logical condition of inanition con- 
tinued to the time of learning in the 
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experimental animals, rather than 
to other effects of early experience 
(10). A perceptual factor may also be 
involved. Rats which have been de- 
prived of food and water in infancy 
surpass normal animals in learning 
the locations of food and water in a 
maze even though they are satiated 
at the time of testing (17). 

The studies of Harlow (45) on 
primate learning have shown how the 
learning performance of monkeys 
and chimpanzees may be increased 
by prolonged experience with various 
aspects of the problems presented. 
According to Hebb’s theory, if this 
type of experience were gained in 
infancy it would have an even more 
marked effect on adult learning ca- 
pacity, but very few such experi- 
ments have been conducted with 
primates. In one study a home-reared 
chimpanzee was compared on four 
simple problems with a laboratory- 
reared chimpanzee, and with four 
human children the same age as the 
animal. The home-reared ape and 
the children did equally well but the 
laboratory chimpanzee could only 
solve one out of the four problems. 
However, since the solution requir 
imitation of the human experimenter, 
a requisite which the laboratory- 
reared animal had had no opportunity 
to acquire, the interpretation of these 
results is ambiguous (48). 


DISCUSSION 


Reviewing the studies in the fore- 
going pages has left the authors with 
two basic convictions. The first 18 
that a fuller understanding of the ef- 
fects of early experience upon subse- 
quent behavior is fundamentally 
important for a science of psychology 
and for the broader field of animal 
biology. A second and equally strong 
impression is that much if not most 0 
the presently available evidence bear- 
ing upon this problem is equivoca 
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and of undetermined reliability. 
Many of the studies cited in this 
review are primarily of a naturalistic 
character and some of them verge on 
the anecdotal. Approximately 10 per 
cent of the reports contain conclu- 
sions based upon the behavior of a 
single individual. Many of the re- 
cent laboratory investigations reveal 
weaknesses in experimental design, 
and some of them are marked by lack 
of sufficient skepticism in theoretical 
interpretation. We have noted at 
various points the way in which vari- 
ables have been confounded without 
recognition of the fact. Thus, the 
effects of special conditions of rearing 
upon intellectual performance have 
been described in some cases without 
acknowledgement of the additional 
effects upon emotional stability which 
might well influence performance in 
learning tests. The results of prevent- 
ing a particular response have been 
described without considering the 
confounding effects of deprivation in 
Sensory experience. Natural selection 
in studies involving successive gener- 
ations, speed of eating in measures of 
hoarding, and other variables have 
not always been subjected to control 

in the experimental design. 

, It must be admitted that there are 
instances in this area where inde- 
E control of all the relevant 
in a x is difficult, if not actually 
o le. This is especially true 
i sana to the age variable itself 
lets "i esigned to measure the later 
differen: experimental treatments at 
Tune E levels in infancy. Ana- 
s erences, nutritional re- 
evelopme Sensory sensitivity, motor 
ate clone! nt, and previous experience 
domes Jj interwoven variables with 
Benet usually be controlled 
ir E of each other. Any ef- 
i. : to the different ages at 
Eu eatment is given may be 
9 chronological primacy by 
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itself, but to differences in any one of 
the above variables or combination of 
them. This dilemma does not detract 
from the value of studying such prob- 
lems. Rather, it means two things: 
first, that there are stages in the his- 
tory of every science when investi- 
gators are, for a time, forced by the 
limitations of their knowledge to 
study the interaction of variables, 
rather than isolated ones; and second, 
that such interactions should be seen 
and labeled as such whenever theo- 
retical conclusions are made. 

Because the general thesis that 
early experience does affect later be- 
havior is so often taken for granted 
and assumed without further elucida- 
tion, it may be clarifying to conclude 
with some consideration of the basic 
ideas underlying this general type of 
research. In what ways does early 
experience influence later behavior? 
Broadly speaking, the evidence we 
have surveyed suggests three answers 
to this question. 

Persistence in adult behavior of 
habits formed in early life. The per- 
sistence of acquired behavior through 
time is, of course, a general phe- 
nomenon throughout the life of an 
animal, but there are several reasons 
why habits acquired early in life 
might be especially persistent. When 
a very young animal learns a response 
to a stimulus situation, the occur- 
rence of that response each time the 
situation presents itself may prevent 
the acquisition of other types of be- 
havior which could compete with the 
original one. This appears to have 
been the case in some of the studies 
on pecking in chicks, in which the 
acquisition of an unnatural eating 
response to a spoonful of food pre- 
vented the development of the nor- 
mal pecking reaction (96). The same 
explanation seems applicable to the 
finding that male rats which, in in- 
fancy, have formed habits of respond- 
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ing playfully to receptive females, 
never display completely normal sex- 
ual reactions after they become ma- 
ture (66). 

An additional reason why habits 
learned early in life are sometimes re- 
tained better than those which are 
acquired later involves differences in 
motivation with age. It is possible 
that certain physiological needs and 
their corresponding "drives" are 
more intense in infancy than in adult- 
hood. Younger rats, for example, will 
make many more grid crossings to 
food than older rats tested after the 
same duration of food deprivation 
(81). Rats starved just after weaning 
are therefore probably hungrier and 
have a more intense experience than 
rats starved at a later period. Asa 
result, the effects of early experience 
might be expected to persist because 
of the intensity of the motivation un- 
der which the experience originally 
occurred, while the effects of later ex- 
perience would not persist. It is con- 
ceivable that Hunt's first results 
showing the greater effect of early 
than late deprivation of food on adult 
hoarding of food (59) can be inter- 
preted in this way. A similar case 
could be made for the more lasting 
effects of early trauma. Because of 
the young animal's unfamiliarity 
with its environment, and its lack of 
a learned repertoire of fear-reducing 
responses, fear in infancy is probably 
of a greater intensity than similarly 
produced fear later in life. It is 
known, for example, that incidence of 
seizure reactions to air blasts is a de- 
creasing function of age in rats (30, 
34). In the same manner, defeat in a 
fight, as occurred in Kahn's study of 
mice (67), may be a more intense ex- 
perience for a young animal than for 
an older one; and for this reason, its 
traumatic effects might last longer 
and have a greater influence on subse- 
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quent behavior. 

Early perceptual learning aff 
adult behavior. Quite distinct 
the theory of retention in adul 
of the same overt responses learnedin _ 
infancy is the hypothesis that certain 
types of early experience influence” 
later behavior by structuring the ine 
dividual’s perceptual 


capacities, ' 
Practically all the studies we have 
cited in the fields of sensation and 
learning are based upor this view, 
Mention has already been made of I 
Hebb's analysis of the development. Ji 
of behavior. Early learning is com — 
ceived to consist largely of the estaba 
lishment of “perceptual elements" 
(“phase sequences,” neurologically) 
which serve as the basis of learning 
later in life (52). It is far too early to — 
evaluate the explanatory value of this 
theory, but several studies have - 
shown that absence of normal sensory - 
or motor experience in infancy Cam. 
severely restrict the reactive abilities — 
of adult animals. , 
Critical periods in development. A 
third idea which underlies many re- 
searches involves the concept of spe- 
cific stages in ontogeny during which 
certain types of behavior normaly 
are shaped and molded for life. This. | 
hypothesis may have been suggested 
by certain findings in experimental 
embryology. If at a certain stage mm 
the development of the frog embryo; 
part of the presumptive belly wall is 
transplanted to the region where the 
nervous system is later to develop; 
the transplanted tissue develops into 
a portion of the neural tube—the 
future of the cells thus being deter- 
mined in part by their environment — 
a process which Spemann (121) cal ed 
induction. At a later stage, howevēt 
the cells lose their susceptibility 10. 
such environmental influence. Simi 
larly, there may be relatively briet 
stages in the behavioral development 
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of animals during which the future of 
certain aspects of behavior is strongly 
affected by contemporary environ- 
mental influences that have no such 
effect at other points along the life 
line. 

Applying this theoretical viewpoint 
to the behavioral development of 
birds, Lorenz (78) concluded that 
there are specific and restricted peri- 
ods during which the stimuli which 
will thereafter evoke certain instinc- 
tive responses are permanently de- 
termined. After the critical period 
has passed, the environment cannot 
alter the nature of the effective stimu- 
lus. Studies on imprinting suggest 
that this may be true for the filial re- 
sponses of ground-nesting birds, in 
which case the critical period is re- 
stricted to the first 24 hours after 
hatching (1, 29). 

More recently, the idea of critical 
periods has been applied to dogs. 
Scott and Marston (114) divide the 
development of the puppy into se- 
quentia periods according to the 
kinds of behavior that are possible. 
First is the neonatal period, lasting 
approximately ten days, when the 
animal is both blind and deaf, and 
therefore somewhat isolated from the 
environment, A transition period fol- 
lows to three weeks of age, at which 
Point conditioned responses are said 
to become possible fairly suddenly 
(39). Then comes a socialization 
Tue lasting to ten weeks of age. 
mr described as a critical period 
ie ^ ing the future social adjustment 

t € individual (113). It is followed 
E the juvenile period, and, after 

xual maturity, the adult stage. 
ties E are several obvious similari- 
aa the hypothesized critical 
JA iv in animals and various theo- 

i pe the development of human 
E. Hi . For example, Freud's con- 

Iscrete stages in psychosexual 


development and the lasting effects 
of fixation at one level or another calls 
to mind Levy's demonstration that 
interference with normal oral re- 
sponses in a period when they pre- 
dominate in an animal's behavior 
may influence later behavior of the 
same kind. And Murphy's use of the 
concept of canalization (90), which is 
regarded as an irreversible learning 
process whereby some needs come to 
be satisfied only or predominately by 
certain, specific stimuli, bears a re- 
semblance to Lorenz's theory of im- 
printing. Descriptions of the devel- 
opmental stages in puppies are some- 
what reminiscent of Piaget's account 
of the progressive development of 
social behavior in young children 
(99). It is tempting to suggest the 
existence of similarities in underlying 
mechanisms, but this would be un- 
warranted on the basis of present 
knowledge. 

In listing three possible answers to 
the question of how early experience 
can affect later behavior, we mean to 
imply neither that they are mutually 
independent, nor that they are to- 
gether exhaustive. Rather they ap- 
pear as currents of thought along 
which research in this area has pro- 
gressed. 

In conclusion, it can be said that 
the longitudinal approach to the 
study of animal life is obviously more 
beset with methodological problems 
than studies concerned with only one 
portion of the life span. Experimental 
control in this area is more difficult to 
realize, because the animals, by the 
very nature of the problem, are ex- 
posed to numerous variables, some of 
which may not yet be recognized or 
capable of isolation. These difficul- 
ties, however, are more than offset by 
the general interest and wide signifi- 
cance of the problems that may be 
studied. 


8. BERNSTEIN, L. 
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The purpose of this paper is to 
introduce sequential analysis to psy- 
chological researchers, and more in- 
cidentally, to discuss the operating 
characteristic function for statistical 
tests in general. Sequential analysis 
is not presented adequately in the 
statistical texts most commonly con- 
sulted by psychologists and has 
rarely been used in psychological re- 
search. Yet it is a statistical method 
that has the important advantage 
that it minimizes the average number 
of observations required to reach a 
specific statistical decision; probably 
it will be most useful to research on 
personality and other topics where 
the cost per unit (i.e., per observa- 
tion or pair of observations) is high. 
The approach to statistical problems 
in terms of the operating character- 
istic or power of a test also has re- 
ceived little attention in textbooks on 
psychological statistics (cf. 8, pp. 65- 
67). This approach, which is impor- 
tant in sequential analysis, has gen- 
eral relevance to statistical tests of 
hypotheses. 

Sequential analysis is fully pre- 
sented in a book written by its prin- 
cipal inventor, Wald (14). A brief 
technical presentation can be found 
in Mood (11). A more definitive and 
detailed exposition, presented in the 
context of sampling inspection, is 
also available (3). The chapter dis- 
cussing it in Dixon and Massey (5, 
ch. 18), while presenting sequential 
procedures in somewhat different 
form, may be the most readily avail- 


1 The authors are indebted to Professor W. 
Allen Wallis and Dr. Esther Seiden for their 
valuable criticisms of an early draft of this 
paper. 
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able introduction for psychological 
investigators. 

The few relevant papers using sê- 
quential analysis have essentially 
been demonstrations of the method 
as applied to test construction and 
test administration. Cowden (4) 
gave a final course examination se 
quentially, with each student being. 
required to answer successive sets of. 
questions until a decision could be 
reached concerning his achievement; 
Moonan (12) suggests possible uses: 
of the method for problems involving 
the optimal number of items or for 
spotting extreme students, but con- 
cludes that the method is usually not 
worth while in education, where the 
expense of collecting data is rela- 
tively independent of the number of 
items used. The method was applied 
by Kimball (10) to the problem of 
checking and rescoring test papers. 
The application of sequential sam- 
pling to item selection is shown by 
Walker (15), who, with Cohen (16), 
provides facilitating tables. Schmid 
(13) offers another approach to the 
same problem. Good agreement 1$ 
reported by Anastasi (1) between de- 
cisions about item selection based on 
sequential analysis and those ba: 
on correlations for a total sample. 


THE OPERATING CHARACTERISTIC 
FUNCTION 


The typical elementary text in — 
psychological statistics defines the — 
probability of errors of the first kind, i 
a, and the probability of errors O 
the second kind, 8. Errors of the first { 
kind (Type I errors) are errors of re 
jecting, on the basis of sample ob- ] 
servations, a hypothesis, Ho, when 
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Hp is, in the population, true; errors 
of the second kind (Type II errors) 
are errors of accepting Ho, when in 
the population an alternative hy- 
pothesis, Hı, is true. Having pre- 
sented such definitions, it is the rare 
text in psychological statistics which 
further discusses the importance of 
errors of the second kind. Rather, 
discussions of actual statistical tests 
of hypotheses emphasize confidence 
levels, o errors, to the exclusion of the 
probability of errors of the second 
kind, B errors. 

The operating characteristic (OC) 
curve, for any statistical test of hy- 
potheses, shows the relationship be- 
tween the probability of accepting a 
particular hypothesis and the true 
value of the parameter under con- 
sideration. The height of the OC 
curve at a given parametric value, 
n, represents the probability of ac- 
cepting the specified hypothesis when 
the observed sample of observations 
has been randomly selected from a 
population with the parameter p. 
The use of the OC curve is exempli- 
fied in Fig. 1. 

Let us suppose that we are drawing 
à large random sample from a nor- 
mally distributed population. Let 
us choose a level of significance, 
4—.05, for our test. In Fig. 1 are 
exemplified two OC curves, one for 
each of the following two tests: 
an We wish to test the hypothesis 
Ea the population mean is zero 
ae against the alternative 
The se that u differs from zero. 
bus ght of curve A, in Fig. ie 
w a the probability of accepting 
Een Ho for all possible 
the etric values of u. For example, 
thé Tn population mean is zero, 
this a ability that we will accept 
T docthena is .95, or, in general, 
differ : the true population mean 

$ from zero by three standard 
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*o* t # 
A 


Fic. 1. ILLUSTRATIVE OC Curves FOR ONE- 
TAILED AND TWO-TAILED TESTS 


errors of the mean, the probability 
that we would accept Ho is reduced 
to approximately .16. This value is 
equal to f, the error of the second 
kind, for the particular alternative 
hypothesis p= 37/ /N. In general, 
the height of the OC curve is 8 for 
any point along the abscissa other 
than that point included in the speci- 
fied hypothesis, Ho. 

2. We wish to test the hypothesis 
u=0 against the alternative hypoth- 
esis p>0. The probability of ac- 
cepting this hypothesis, as a function 
of the true parametric mean, is given 
by the height of OC curve B in Fig. 1. 
Again that height at p=0 is .95 or 
1—a. At p=30//N that height is 
08. For the particular alternative 
hypothesis p=30/V/N, the B error 
for this test is half the magnitude of 
the f error for test 1 above. For this 
alternative hypothesis, this test is 
more “powerful” than test 1. In gen- 
eral, the "power" of a statistical test 
is defined as 1 —8 for any particular 
alternative hypothesis; while from an 
OC curve, one reads the magnitude 
of 8; from a “power curve," one reads 
the magnitude of 1—8. 
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If, in this hypothetical problem, 
we- were primarily interested in 
guarding against the alternative hy- 
pothesis u—3e/4/N, then we would 


choose test 2 rather than test 1, thus ~ 


reducing the probability of an error 
of the second kind. Such a choice 
would preserve the confidence level, 
a=.05, but would maximize the 
power of our test. Often, in designing 
statistical tests of hypotheses, we 
have a choice among a number of 
alternative tests. Our choice can be 
optimally resolved if we know the 
OC function for each of the tests, for 
then we can choose that test which 
maximizes the power (minimizes f) 
for a particular alternative in which 
we may be especially interested. 


AN EXAMPLE OF THE 
SEQUENTIAL METHOD? 


The general approach used in 
sequential analysis may now be il- 
lustrated with a hypothetical exam- 
ple. Suppose that we wished to eval- 
uate a new psychotherapeutic tech- 
nique. Let us assume that an accept- 
able criterion for therapeutic success 
is available and that it has been es- 
tablished that the best one of the 
conventional techniques is successful 
in 60 per cent of cases treated. We 
are interested in determining whether 
or not the new technique is more suc- 
cessful than the conventional one. 

To make a test on these bases, we 
must be able to observe results of the 
new technique which may be con- 
sidered random representations of 
a population of results from this 
technique, and which can be reliably 
scored as successes or nonsuccesses. 
Let us assume that these require- 
ments have been met. Our sequential 
plan will consist of examining in se- 
quence single results or groups of 


2 The succeeding presentation closel l- 
lels that in (3). epi 
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results, then of making one of three 
possible decisions: (a) that the new 
technique is no better than the con- 
ventional one, (b) that the new tech- 
nique is better than the conventional 
one, or (c) that we do not have suffi- 
cient evidence to make either deci- 
sion without too great a risk of error. 
If the third decision is taken, another 
result or group of results is observed, 
and on the basis of this additional 
observation, the three decisions are 
reconsidered. Sampling proceeds in 
this manner until we have a basis for 
making either decision a or decision 
b. 

In general, the plan for sequential 
sampling can be expressed in pictor- 
ial form, as in Fig. 2. For this bi- 
nomial problem, the cumulative pro- 
portion of successes is plotted against 
the total number of observations. So 
long as the plotted line remains in 
the region bounded by the two paral- 
lel straight lines, sampling continues. 
Should the plotted line reach the 
upper straight line, the hypothesis 
under test is rejected in favor of the 


Fic. 2. A SEQUENTIAL SAMPLING PLAN FOR 
TESTING HYPOTHESIS ABOUT PROPORTION 
AGAINST ONE-SIDED ALTERNATIVE 


— ——— 
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alternative. Should the plotted line 
reach the lower straight line, the 
hypothesis under test is accepted. 
The slope and the intercepts of the 
two parallel lines are completely 
predetermined by characteristics of 
the statistical test. 

Now let us return to our example. 
From the formulation of the prob- 
lem, one of the hypotheses in which 
we are interested is obvious, i.e., that 
the new technique produces no more 
successes than the old one. In formal 
terms, we wish to test the hypothesis 


.60, 


Hoi t = To = 


where 7 is the proportion of successes 
in the population from which our 
sample is drawn. The choice of an 
alternative hypothesis depends upon 
what we mean by “better than." In 
this case, we might decide that if the 
new technique is better than the old, 
as evidenced by a 15 per cent increase 
in successes, such a difference would 
: of considerable importance; we 
might also decide that a less dramatic 
difference than this might be of ques- 
tionable practical importance. On 
the basis of such considerations, we 
Would choose a second hypothesis 


Hym = m 5. 


We now choose a risk « of rejecting 
Hy when H, is true, and a risk 8 of 
V tng Hy when H, is true. In our 
eo we might choose to make 8 
eae risk of the two, for it 
a be a costly mistake to con- 

ude that the new technique was no 
e than the old, when actually 
EV technique produced 75 per 
5s Ox as against 60 per cent 

€ old. We might then choose 


.10, 
and 
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Having chosen the four values 
Ho, Hi, æ, and B, we have completely 
determined the characteristics of our 
sequential test and are able to con- 
sider the OC curve for the test. To 
draw a rough sketch of a sequential 
OC curve, we are able to make use 
of the following five points, where L 
is the height of the OC curve: 


a. L(x = 0) = 1; 

b. «= L(w =m) =1-— a; 
Hie 

ktk’ 


c. L(r2s)- 


i 1—a à 
og 
kb E AA and [2] 

mifi — To 

re | (2) 
mol- m 
7 

log E 

jy ee D 


m (1 — To 
we [C=] 

To 1 — GE 
A number of additional points on the 
OC curve can be found; for a thor- 
ough explanation of methods of de- 
termining the OC curve, the reader 
is referred to the report of the Sta- 
tistical Research Group, Columbia 
University (3). 

In the present example, the oc 
curve is as sketched in Fig. 3. We are 
now able to visualize the characteris- 
tics of our sequential test, in terms of 
the OC curve, and to decide whether 


> 
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Fic. 3. THE SEQUENTIAL OC Curve 
FOR BINOMIAL EXAMPLE 


these characteristics are satisfactory 
for our purposes. If they are not, we 
alter Hy or H, or « or B in such a 
manner as to make the test more 
suitable. If the characteristics are 
acceptable, we proceed further in de- 
termining the sampling plan. 

Referring to Fig. 2, the decision 
regions are completely defined by a 
a value for the slope of the two parallel 
lines, and by an intercept value for 
each of the two lines. In general, the 
equations of the two parallel lines 
may be written 


fo = 


—ko+sn [4a] 
ki + sn, [45] 


where r stands for the number of suc- 
cesses, ? the cumulative number of 
observations, s the slope, and — ko 
and kı for the intercepts. It can be 
shown that s, ko, and k; are the values 
defined previously in equations [1], 
[2], and [3] (3, sec. B.072). For the 
present example, £$4—4.170, ky 
—3.248, and s —.678. Our sequential 
plan is that represented by the two 
parallel lines of Fig. 2. 

To illustrate an application of se- 
quential analysis to a binomial prob- 


ry 
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lem, consider the following 
experiment. Observations are se 
lected from a table of random num 
bers. Assuming a population z of 80, 
we record any digit from 1 to 8a suc 
cess (a '"1"); digits 9 aud 0 are re 
corded as failures (“0”). A sequence - 
of such observations has been made, 
and the results are presented in the 
order observed: 


101111110111 122 


This sequence of observations is 
graphed in Fig. 2; since the plotted 
line reaches the upper straight line on 
the sixteenth observation, sampling. 
stops at that point, and we reject He 
in favor of Hı. We conclude it to be 
more likely that +=.75 than that 
s —.60, the confidence in the conclu- 
sion being specified by the probability 
of Type I error, a=.10. 


THE AVERAGE SAMPLE SIZE 


In addition to the OC curve for à 
sequential test, there is another curve 
of considerable interest. This is à 
curve of average sample size (or aver- 
age sample number, ASN) as a func- 
tion of the true population parameter. 
It is useful to refer to the ASN curve 
to determine the expected cost, Im 
terms of sample size, of achieving 
that assurance of a correct decision 
selected from the OC function. 

The average sample size depends 
upon « and 8; the smaller these risks, 
the greater the size that the sample 
will usually be before a decision 15 
reached. But the ASN also depends 
upon the hypothesized values mo aM 
Tı, and upon the population propor- - 
tion of successes, m. For given values 
of o, B, To, and m, we can write the 
ASN as a function of m. If 7 corre 


E 


Ss D 


ASN (r = m) = 


sponds to the hypothesized propo 
tion, To, we have 
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lí x corresponds to mı, 
(1 — 8)h — Bko 
Ó———— P 


10734 


ASN (v = m) = 


where s, ko, and & are those values 
given in equations [1], [2], and [3]. 
These two values, ASN (x=70) and 
ASN (r-7,), are larger than any 
average sample size necessary to 
make a decision if m is not between 
rs and 7. 

Three additional points on the 
ASN curve may be found to be: 


ASN (r = 0) = LT 
$5 
ASN (r = $) = hh 
s(1 — s) 
a h 
ASN (x 1) 1 
1—5 


Thus, just as we were able to sketch 
the OC curve from five points, given 
any sequential plan, we are able to 
sketch the ASN curve from five 
points. The ASN curve for our earlier 
example is given in Fig. 4. For any 
m pia it is not difficult to 
ve for a larger number of points on 
the ASN care (3). ae 


Some OTHER SEQUENTIAL TESTS 


Requirements regarding untested 
Parameters. The simpler forms of se- 
quential test assume that either the 
E or the mean is known. Tests 
ne means and differences be- 
Een means assume. that the vari- 
a known. Similarly, the con- 
^m 5x ne of hypotheses involving 
the Él ard deviation assumes that 
eg is known; a more laborious 

ES is available when it is un- 
he (This test for the standard 

Ms ion is presented in [3, 5, 14].) 
Hes euer involving means. Appli- 
E. of the basic equations for test- 

8 hypotheses concerning means as- 
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ASN 


J0 10 
Ls 


Fic. 4. THE ASN CURVE FOR 
BINOMIAL EXAMPLE 


sumes that observations are drawn 
from a normally distributed popula- 
tion. These equations, defining the 
parallel decision lines, are 

ui — M uy — uw? 

mI MEL ER 


g? a° 


[5a] 


; and 


= log. 

à 1—a« 

da — Uo uy — h? 
—— — 2X; 

o? * 20? 


Bey 


n 


[55] 


where uo and u are the means stated 
in our two hypotheses, Ho and £s, re- 
spectively, X; is a single observation, 
g? is the variance of X, 1 is the num- 
ber of observations, and a and f are 
the specified probabilities of Type I 
and Type II errors. Since these for- 
mulas involve the variance, the ex- 
perimenter must either have a good 
estimate of the variance from other 
data or must take an initial group of 
cases (n) of sufficient size to yield a 


= loge 
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reasonably reliable estimate (s*) of 
the variance. If the estimate is too 
small (or too large), he will be un- 
wittingly increasing (or decreasing) 
the values of a and 8 and thus will 
make his test less (more) sensitive 
than he has planned to have it. 

By solving formulas [Sa] and [5b] 
for ZX,, we obtain the following equa- 
tions: 

B (e?) 
IX: = log, ——— — —— 
Tra Un — m) 
+ are, 


1-8 (e?) 


(us + 140) 
2 


These are the equations for the 
parallel lines in a graph representing 
the sequential plan for testing hy- 
potheses involving means. To carry 
out the test, we plot on the graph the 
observed EX, for each value of m. 
When the observed cumulative sum 
falls outside one of the lines, the 
appropriate decision is made. 

Difference between means. A con- 
venient sequential procedure is avail- 
able for testing the significance of 
the difference between means of two 
equal-sized samples. In almost any 
instance where the sequential ap- 
proach would be preferred to a con- 
ventional method based upon fixed 
A's, the use of equal-sized samples is 
not a serious restriction. 

The procedure involves the use of 
difference scores between randomly 
selected pairs, or matched pairs if the 
groups are matched case for case. 
The sum of these difference scores is 
plotted on the figure specified by 
formulas [6a] and [6b], with xo set at 
zero and u; determined by theoretical 


and [6a] 


H [65] 
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or practical considerations. Once 
again, the procedure assumes that the 
variance is known. The variance used 
here is, of course, the variance of the 
difference between means (for corre 
lated or uncorrelated samples, which- 
ever is appropriate). 

Two-sided hypotheses involving 
means. All the preceding tests were 
one-sided; for example, the tests in- 
volving means specified just two val- 
ues, «o and tı, Even the test of mean 
difference required a single specific 
value for the alternative hypothesis. 
The procedure outlined below is de- 
signed to test whether the mean of a 
group deviates in either direction 
from some specified value. Thus in- 
stead of setting one value for the 
alternative hypothesis, we set two, 
which deviate the same amount but 
in opposite directions from the hy- 
pothesized central value, o. (For a 
full account, see 3.) 

Let d be the deviation of each of 
the two means, specified under the 
alternative hypothesis, from the 
mean, uo, specified in the null hy- 
pothesis. Let our critical values be 


8 ; and 


a = log. 


B 
b — log, 
: 1—2o 
The formula required for the sequen- 
tial test of significance is 


4 zQ-w)| 4" gj 


g? 207 


G=log, cosh 


where z is the number of observations 
(as previously), X; is any observ! 

score, g? is the true variance of the 
population from which the observa 
tions are drawn, and cosh refers tO 
the hyperbolic cosine. Tables of the 
hyperbolic cosines are accessible (e.£ 
2, Table XV; 3, Table 5.2). If GiS 
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greater than a, the null hypothesis is 
rejected in favor of thealternative 
hypothesis that the mean of the 
group differs from o by the amount 
d. If Gis more negative than b, we 
accept the null hypothesis that the 
mean of the group does not differ 
from uo. 

For most values, it is unnecessary 
to use the tables for the hyperbolic 
cosine because log, cosh s is asymp- 
totic to z—log, 2 or s—.693. The 
equivalence is exact to two decimal 
places if 222.7. Hence we may 
modify formula [7] as follows: 

d| E(X: — 1«)] 


2 


g? 
2 


d 
— — n—.693. 
2 


2o 


. [8] 


We set formula [8] equal first to a and 
then to b, and solve for |Z(X;—w)|: 


| Z(X; Es u) | 


d : 693 
Eni ver, and [9a] 
| zar, - 1) | 
Wa e*(b + .693) 
sinp A o 


ie equations yield the parallel 
c in Fig. 5, which has been set up 
"d the following example: 1-10, 
E d=6, a=.10, 8—.05. Data 
i three sampling experiments are 
EM on Fig. 5, the values being ob- 
ined by drawing random sequential 
. ‘amples from normally distributed 
Populations with «=9, 16, and 20 re- 
Spectively. 
An important consideration must 
i mra Equations [9a] and 
is E based upon equation [8], 
^ is an adequate approximation 
€ precise equation [7] so long as 


322.7, where 
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Fic. 5, A SEQUENTIAL PLAN FOR TESTING 
Hypornests ABOUT MEAN AGAINST Two- 
SIDED ALTERNATIVE 


V" 4 ZO) 
m 


For z«2.7, equation [7] must be 
applied. It is sufficient to evaluate 
three or four values of G, correspond- 
ing to values 0<s<2.7, from equa- 
tion [7]. These values provide points 
on the curved portion of the lower de- 
cision line of Fig. 5, and allow the 
necessary correction to become part 
of the graph of the sequential plan. 

In Fig. 5, the sampling experiment 
for u=20 gives a value of 70.32 at 
n=6, allowing us to reject Ho:u=10, 
in favor of Hy:u=10+6. The sam- 
pling experiment for «= 16 allows the 
same decision at n=4. For u=9, it 
has required an n of 17 to yield a de- 
cision, in this case acceptance of Ho 
(with probability of erroneous deci- 
sion, 8 —.05). 

The application of this method to 
a two-sided test of the difference 
between two means is clear-cut. 
Once again we take pairs of observa- 
tions, one from each group, and 
analyze the differences for the z pairs. 
The value of wo is of course zero. 

Hypotheses involving correlations. 
No convenient method is currently 
available for utilizing sequenti. 
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analysis in tests of hypotheses in- 
volving correlations. (It is possible 
that the labor involved in the compu- 
tation of correlations may offset the 
advantages of sequential design.) 
While the product-moment correla- 
tion coefficient, r, is a mean, the mean 
cross product of the pairs of observa- 
tions in standard score form, the 
sampling variance is a function of the 
population correlation and therefore 
would differ for two hypothesized 
correlation coefficients. 

In many cases, a relationship can 
be measured by some statistic other 
than r. However, the nonparametric 
statistics, rho and tau, are unsatis- 
factory for sequential tests. For each, 
as for r, the variance is a function of 
the relationship in the population. 
Furthermore, the sampling distribu- 
tions of rank coefficients for the non- 
null case are complex; the approxi- 
mation in the case of tau (9, chs. 4 
and 5) seems too crude for use in se- 
quential tests. 

In this connection, Wolbers (17) 
has suggested a procedure for esti- 
mating correlation coefficients as data 
accumulate. In the method he sug- 
gests, each variate is dichotomized at 
the median. He provides a table for 
estimating the relationship from the 
number of cases in the accumulated 
sample and the number of cases falling 
on different sides of the two medians 
(e.g., above the median on X and 
below the median on Y). However, 
this method yields an estimation. It 
is not a sequential test of hypotheses. 

A general test of hypotheses involving 
double dichotomies. A sequential test 
is available for determining which of 
two groups has the higher proportion 
of some specified condition. This 
test applies to the general case of test- 
ing differences in proportions of 
"successes" in two samples. It could 
have been used for an experiment on 


the problem involved in our original 
example, that of comparing the pro- 
portions of successes produced by 
two therapeutic techniques. Patients 
would be assigned randomly to two 
groups, one for each type of therapy, 
and the proportions of successes in 
each would be ascertained empirically. 

Such a test, for difference between 
proportions, should be contrasted 
with that test applied in the earlier 
example to test the significance of the 
deviation of a sample proportion from 
an a priori value. Since the test 
model for double dichotomies is more 
complex than the methods previ- 
ously discussed, the reader is referred 
to the original source for details (3, 
sec. 3). 


GENERAL REMARKS 


The selection of the alternative hy- 
pothesis. The alternative hypothesis 
tested in a sequential test specifies a 
parameter other than that given by 
the null hypothesis. How does the 
experimenter select this value? We 
may distinguish two general situa- 
tions. In the first situation the ex- 
perimenter can specify a value for Hi 
on one of three particular bases. (a) 
He may be testing a theoretical esti- 
mation of the parameter. Rarely 
would this basis be available to the 
psychologist, since current theory 
does not seem sufficiently precise. 
(b) He can choose a value on the basis 
of previous empirical findings. (c) He 
can decide on practical grounds that 
the parameter must be at or beyon 
the specified value because lesser vat- 
ues would make no practical differ- 
ence. Some of the earliest applica- 
tions of sequential analysis were to 
problems of sampling inspection 
where the parameter presumably was 
chosen on such grounds. 

But another general situation may 
exist where none of the above bases 
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for decision is available or appropri- 
ate. The experimenter may wish to 
use a sequential test for the sole 
purpose of economy of effort. What 
rationale can he follow? For these 
circumstances, we suggest a plan 
based upon a rough approximation to 
the largest number of observations 
that the experimenter is willing to 
use. Having chosen a trial Hı such 
that for selected values of a and 6 the 
OC curve is satisfactory, the experi- 
menter can compare the highest point 
on the ASN curve with his preferred 
maximum N. If the maximum ASN 
exceeds the preferred maximum N, a 
second value of Hy, more distant from 
Hy than the first, may be chosen and 
new OC and ASN curves drawn. 

f The chance that the actual sample 
size required to make a decision will 
exceed the maximum ASN is small, 
definitely below .5. Usually one of 
the hypotheses will be accepted long 
before the sample size reaches the 
maximum ASN, and often when the 
number of observations is less than 
half that size. 

It is obvious that the closer to- 
gether the values postulated in Ho 
and H,, the larger the ASN (cf. Fig. 
4). As in conventional statistics, 
small effects are more difficult to 
isolate than large ones. On the other 
hand, if the difference between the 
two hypothesized values is unreason- 
m large, the experimenter gains 
ittle by deciding to accept one hy- 
pothesis in preference to the other. 
im when an alternative hypothe- 

can be determined on a priori or 

Bartica! grounds, it is advisable to 
onsider the OC and ASN curves in 
Sia a sequential test. Only in 
eed can an experimenter be com- 
hich. assured that his design is one 

B y will oe the test he 
ate ASN) acceptable cost (moder- 


Sequential tests vs. conventional 
tests. Does a sequential test have any 
advantages when the selection of a 
second hypothesis is done on an essen- 
tially arbitrary basis? The chief ad- 
vantage is economy. Sequential 
analysis produces an average saving 
in sample size of about one-third to 
one-half, when compared with the 
most powerful current statistical 
tests. However, the experimenter 
using sequential analysis will use 
more cases about 25 per cent of the 
times he uses this test if a=B=.05. 
(Rarely will he have to use twice as 
many cases.) If a=8=.01, the prob- 
ability is about .9 that he will effect 
a saving (cf. 14, p. 57 and p. 60). 

This advantage can be seen intui- 
tively in our first example. Here, the 
two hypotheses stipulate the pro- 
portion of successes at .60 and .75 re- 
spectively. If our first eight randomly 
selected cases had included no failure 
case, it would be very unlikely that 
the observed sample p subsequently 
would drop to a value so low as to 
lead to acceptance of H»:7 — .60. On 
the other hand, if six of the eight 
were failures, it would be unlikely for 
subsequent sampling to produce a p 
so high as to lead to acceptance of 
Hy:r=.75. Such extreme results are 
uncommon. However, sets of obser- 
vations showing some tendency to- 
ward an extreme occur sufficiently 
often to make the method economi- 
cal. Sequential analysis takes advan- 
tage of the fact that some possible 
sets of n observations make improb- 
able (or impossible) certain obtained 
values for n+1, n+3, or even 3n ob- 
servations. 

An evaluation of the sequential de- 
sign. The uncritical general use of 
sequential analysis obviously is not 
recommended. It is a design which 
can have advantages when one or 
more of the following conditions 
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holds: (a) The problem involves the 
choice between two possible parame- 
ter values which can be specified on 
a priori but not arbitrary grounds (see 
“The Selection of the Alternative Hy- 
pothesis," above). The null hypothe- 
sis will usually be one of the two. (5) 
The data are such that the cost per 
datum is high and economy is de- 
sired. (c) The total amount of data 
is not fixed. In this situation, the 
experimenter takes a calculated risk. 
We have seen above that in a small 
proportion of applications of sequen- 
tial tests, he will require more cases 
than he would if he used a more con- 
ventional statistic. However, the 
odds are great that he will economize 
on his data collection. 

As Wallis (7, ch. 17) points out, in 
the nonsequential test, N is fixed, 
and with one risk specified (a), the 
second risk (8) is minimized. In the 
sequential test, we specify both risks 
(a and 8) and minimize the sample 
size. 

Inasituation where we are able to 
specify both a and 8, but where it is 
uneconomical to collect observations 
piecemeal, it may be preferable to 
estimate the desirable sample size 
and proceed with a conventional 
study, collecting and analyzing all 
data at the same time (cf. 5, ch. 14, 
especially pp. 212-213 and pp. 219- 
220; also 6, pp. 153-157). For this 
latter situation, the total effort re- 
quired to reach a decision can be esti- 
mated in advance. In a sequential 
test, such an estimate will necessarily 
be only a rough approximation. 


SuMMARY 


This paper seeks to introduce se- 
quential analysis to psychological re- 
searchers as a statistical design that 
has considerable promise for work in 


areas where the cost perobservation 
is high. 
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The basic design involves the test- 
ing of alternative hypotheses, one of | 
which is usually a null hypothesis. 
The experimenter sets up three al- 
ternative decisions, one of which is ac- 
cepted after each set of data is ana- 
lyzed: (a) he may accept the first 
hypothesis; (b) he may accept the 
second hypothesis; (c) he may accept 
neither and take another observation. 
If he makes the third decision, he 
makes another observation (or set of 
observations) and then repeats the 
analysis. This procedure is followed 
until he accepts one hypothesis or the 
other. 

In its more convenient forms, the 
sequential test assumes knowledge of 
the population variance or (in testsof 
variance) knowledge of the population 
mean. The applications of the statis- 
tical design to problems involving 
proportions, means, and mean dif- 
ferences are illustrated. Methods are 
given for sketching operating charac- 
teristic and average sample number 
curves for sequential tests. The use 
of these curves is exemplified and 
their value is emphasized. 
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SPEED OF LEARNING AND AMOUNT RETAINED: 
A CONSIDERATION OF METHODOLOGY 
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A common generalization found in 
textbooks is that the fast learner 
retains more than the slow learner. 
McGeoch (6), apparently on the basis 
of Gillette's work (1) and on the basis 
of his general conception of learning 
and forgetting processes, wrote: 
“This high positive relation between 
individual scores in learning and re- 
tention is to be expected, of course, 
from the fact that these are continu- 
ous processes” (6, p. 388). Although 
not explicitly stated in terms of rate 
of learning and forgetting of individ- 
uals, the present writer has also as- 
sumed the validity of the general 
principle that speed of learning and 
amount retained are directly related 
(8, p. 510). Kingsley (4, p. 469), Munn 
(7, p. 213), and Hilgard (2, p. 10) 
have also accepted the generalization. 

The present paper has two pur- 
poses. First, we will show that the 
methods used to obtain the data from 
which this principle was derived are 
inappropriate. Secondly, we will 
show that when suitable methodology 
is applied to the problem, there is no 
difference in rate of forgetting of fast 
and slow learners. 


METHODS USED By PREVIOUS 
INVESTIGATORS 


The last extensive work on the 
problem of the relation between speed 


1 Much appreciation is due Jack Richard- 
son, my research assistant. He has helped 
with the analyses and is entirely responsible 
for fitting the curves of Fig. 1. He has also 
critically read the manuscript. 

2 The work was done under Contract 
N7onr-45008, Project NR 154-057, between 
Northwestern University and the Office of 
Naval Research. 
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of learning and amount retained was 
Gillette's monograph (1) published 
in 1936. It is highly probable that 
this monograph is responsible for the 
general conclusions appearing in text- 
books as noted above. In this mono- 
graph Gillette critically summarizes 
the methods and results of all previ- 
ous investigations, going back as far 
as Ebbinghaus. Two methods of in- 
vestigation had been used in these 
previous investigations. Gillette's 
criticisms of these methods may be 
noted briefly. 

Method of equal amount learned. By 
this method all Ss learn a task to à 
given criterion of proficiency, and 
after a rest interval retention meas- 
urements are obtained. The most 
commonly used statistic is the corre- 
lation coefficient between a measure 
of time to learn to the standard cri- 
terion and the recall or relearning 
scores. Gillette points out thatthis 
technique favors retention of the slow 
learner since certain of the items oF 
parts of the task will be much over- 
learned and therefore might be ex- 
pected to enhance retention. This 
would tend to reduce differences 1n 
retention between fast and slow 
learners. Indeed, correlations neat 
zero between time to learn an 
amount recalled are interpreted by 
Gillette as actually indicating the 
slow learners have forgotten more 
rapidly than fast learners. Neverthe- 
less, Gillette rightfully rejects this 
method as a means of reaching 4 
definitive conclusion on the issue. 

Method of equal opportunity to learn. 
By this method all Ss are allowed the 
same amount of time to study the 
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material before the retention interval. 
Again, Gillette notes that this method 
is quite inappropriate because the 
fast learners will learn more than the 
slow learners and therefore may be 
expected to have higher retention 
simply because they have learned 
more. In general, the studies she re- 
views show fairly high correlations 
between amount learned and amount 
retained, and her own data resulting 
from this technique likewise yield 
high correlations. Actually, of course, 
using this technique, fast Ss could 
show more rapid forgetting than slow 
Ss, and the high correlations would 
still obtain if there were wide differ- 
ences in amount originally learned. 
This method, therefore, must be re- 
jected as a means of investigating the 
problem. 

Method of adjusted learning. Gil- 
lette's solution to the methodological 
problem was to use the adjusted- 
learning technique. By this technique 
items in a list are “dropped out” after 
being learned to a minimum criterion 
(such as one correct anticipation). 
Thus, all items of a list are given the 
same number of “reinforcements.” 
And, while slow learners would take 
more time to obtain these reinforce- 
ments than would fast learners, Gil- 
lette assumed that the strength of 
associations at the end of learning was 
the same for fast and for slow Ss. 

Gillette's conclusion from her ex- 
periment using the method of ad- 
Justed learning is as follows: ‘The 
E siad when given sufficient 
EN. earn the same amount as the 
B. ida but not allowed to over- 
E: e material, is not able to re- 

in as much as the fast learner” (1, 
E» Her final conclusion, after 
d all three methods, was that 
e 3 very clear indications that 
ihe bs learner retains better than 

Ow learner, 
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INADEQUACY OF THE METHOD OF 
ADJUSTED LEARNING 

The inadequacy of the method used 
by Gillette can best be understood if 
a brief résumé is given of certain cal- 
culations made on data which were 
available to us. As a result of a series 
of investigations on rote learning 
which we have published elsewhere 
in connection with other problems (9, 
10, 11, 12), we had a large amount of 
data which could be analyzed in an 
attempt to obtain more information 
on the relationship between speed of 
learning and amount retained. These 
experiments involved both paired- 
associate and serial learning, and both 
nonsense syllables and adjectives. 
The lists were always learned to a cri- 
terion of one perfect recitation with 
recall and relearning occurring after 
24 hr. 

Using these data, we first obtained 
correlations between trials to learn 
and recall, and between trials to learn 
and relearn. Since our Ss all learned 
to the same criterion, the method is 
the one referred to above as the 
method of equal amount learned. In 
general conformance with previous 
findings summarized by Gillette, cor- 
relations between time to learn and 
amount recalled were all near zero. 
Correlations between time to learn 
and time to relearn were high and 
positive. Viewed by this method, 
then, our results are quite in accord 
with previous findings. 

Our next step was to analyze the 
data in such a fashion that we would 
in effect be using the adjusted-learn- 
ing technique. We divided a group of 
Ss for a given experiment into two 
equal subgroups based on number of 
trials to learn, thus giving a group of 
slow learners and a group of fast 
learners. We then counted the num- 
ber of correct anticipations made by 
each SS for each item in learning a list 
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to one perfect trial. Each correct 
anticipation we call a reinforcement 
simply because the greater the num- 
ber of correct anticipations the more 
resistant is an item to forgetting. All 
items having the same number of re- 
inforcements were pooled, using sepa- 
rate pools for slow and fast Ss. Thus, 
we might have 200 items which had 
been given five reinforcements while 
being learned by slow Ss and 225 
items which had been given five rein- 
forcements while being learned by 
fast Ss. The final step was to deter- 
mine the percentage correct at recall 
for each frequency of reinforcement 
for fast and slow Ss. This is, in effect, 
comparable to Gillette's adjusted- 
learning technique. 

Results of such comparisons be- 
tween fast and slow Ss for several ex- 
periments showed consistently that 
for items equated for number of rein- 
forcements during learning, retention 
was consistently superior for fast 
learners. This was true regardless of 
the actual number of reinforcements 
obtained during original learning. 
That is, fast Ss were superior in recall 
for items having only a few reinforce- 
ments and for items having many 
reinforcements. Thus, this analysis 
gave strong empirical confirmation to 
Gillette's findings. 

The assumption on which accep- 
tance of Gillette's method (and of 
our modification of it) rests is that 
the making of a correct anticipation 
(a reinforcement) by a slow S results 
in the same associative strength as a 
reinforcement for a fast S. If this 
assumption is not valid, comparisons 
of retention of fast and slow learners 


are as misleading by this method as ` 


by the other methods thus far dis- 
cussed. For, if a reinforcement adds 
more strength to an association being 
acquired by a fast S than it does to 
one for a slow S, it means that degree 
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of learning of the material is not 
equal before introduction of the re- 
tention interval. Comparisons of re- 
tention, therefore, are distorted in the 
same fashion as with the other two 
methods discussed above. 

In order to make comparisons of 
retention of fast and slow learners, it 
is absolutely essential that the re- 
sponse strengths of the items for fast 
and slow learners be equated at the 
end of learning—at the start of the 
retention interval. The method of ad- 
justed learning and our variation on 
this method do not assure that this 
is the case. Indeed, it is a very 
reasonable hypothesis that the essen- 
tial difference between fast and slow 
learners is that a reinforcement does 
result in more associative strength 
for a fast S than for a slow S. Thus, 
the better retention of fast Ss as 
found by Gillette and also by our 
method may be entirely a function of 
the unequal degree of learning before 
the retention interval was introduced. 


TECHNIQUE FOR EQUATING DEGREE 
OF LEARNING FOR FAST AND 
Stow SUBJECTS 


The solution to the problem results 
from what we call a successive prob- 
ability analysis of learning. The data 
we will use to demonstrate the 
method come from five experiments 
on the learning and retention 0 
paired nonsense syllable lists (10). 
Each experiment employed 36 $8. 
We have divided these into two 
groups of 18 each based on number o 
trials to learn, Each S learned and 
recalled three lists, the conditions for 
learning differing only in terms 0 
intertrial interval. These intervals 
(4, 30, and 60 sec.) produced no dif- 
ference in learning, and they do not 
interact with S’s ability level. There- 
fore, we may treat them as three 
equivalent learning tasks for each à. 
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Since there were five experiments, 
and since we get 18 fast Ss and 18 
slow Ss from each experiment, we 
have a total of 90 fast and 90 slow 
Ss, each having learned and recalled 
threelists. Effectively, then, we have 
270 lists for each group, and since 
each list had 10 pairs, we have 2,700 
items for each group. 

The successive probability analysis 
consists of determining the growth 
of the associative function for each 
item. We first observed when an 
item was correctly anticipated ini- 
tially. We then noted whether the 
response was anticipated correctly or 
incorrectly on the immediately fol- 
lowing trial. When an item had been 
anticipated correctly twice (whether 
on successive trials or not), we noted 
whether it was correct or incorrect on 
the next trial. This continued until 
the item was analyzed for the entire 
course of learning. Asan illustration, 
Suppose that an item was first given 
Correctly on trial 6. On trial 7 it was 
also given correctly, on trial 8 incor- 
rectly, trial 9 correctly, trial 10cor- 
rectly, with S also getting allother 
items correct on trial 10. Our entries 
would be: after 1 reinforcement, it 
Was correct on next trial; after 2 
reinforcements it was incorrect on 
next trial; and after three reinforce- 
ments, correct, 
^ pe E an analysis is made for 
Ww m for a large number of Ss, 
etn oe these data are pooled, we 
liu on to make exact state- 
ake he probability of getting an 
‘hae, i on a succeeding trial 
itipetet item has been correctly an- 
hice, th on previous trials once, 

» ree times, etc. 
B Tem be seen, the labor involved 
analysis € this successive probability 
somewhat great. To reduce the labor 
Which Si we eliminated some items 
re very easy to learn and 
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hence had been given a great many 
reinforcements. Working first with 
data for slow Ss, we eliminated all 
items which had been correctly an- 
ticipated more than 31 times by S in 
learning a list. Out of the 2,700 items, 
289 were eliminated by this criterion. 
We then made a distribution of num- 
ber of reinforcements given the 2,700 
items by the fast Ss. On the assump- 
tion that easy items for slow Ss would 
also be the easy items for fast Ss, we 
eliminated approximately 289 of the 
easiest items for the fast Ss. The 
actual number was 322 since we did 
not want to eliminate a portion of a 
category, as would have been neces- 
sary if we had eliminated exactly 289 
items. For these fast Ss, the elimina- 
tion of 322 items left only items 
which had been reinforced less than 
18 times. 

The successive probability curves 
for fast and slow Ss are plotted in 
Fig. 1. The number of observations 
decrease as the number of reinforce- 
ments increase. For the fast Ss a total 
of 18,379 observations are involved, 
with 2,372 for the case of one rein- 
forcement and 54 for the 17 reinforce- 
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ments. For the slow Ss 31,560 
observations were totalled, with 2,403 
for one reinforcement and 28 for 30 
reinforcements. The amount of labor 
involved in compiling the data for the 
curves has some compensation in the 
smoothness of the functions. 

From Fig. 1 it is clear that a rein- 
forcement on a given trial (or series 
of trials) does not result in equal 
probabilities that that response is 
correct on the next trial for fast and 
slow Ss. Following one reinforce- 
ment, for example, the probability is 
.62 for a fast S that the item will be 
correctly anticipated on the next trial. 
For the slow Ss the value is .50. Such 
differences occur throughout the en- 
tire range of reinforcements plotted. 
It is thus apparent that studies which 
have “equated” fast and slow learn- 
ers by equating number of reinforce- 
ments have not equated for associa- 
tive strength. "Therefore, our next 
step is to equate associative strength 
at the end of learning so that com- 
parable retention measurements may 
be made for fast and slow .Ss. 

It can be seen from Fig. 1 that the 
equation of associative strength could 
be made graphically. Thus, we see 
that six reinforcements for slow .Ss 
result in about the same associative 
strength during learning as do three 
reinforcements for fast Ss. Other 
comparable values could be deter- 
mined and two base lines plotted, one 
for fast Ss and one for slow Ss, in such 
a fashion that associative strength at 
the end of learning is equal for the 
two groups at any point along the 
base lines. Such adjustments would 
allow us to say that if these Ss had 
been measured on an immediately 
succeeding trial following the end of 
learning, the associative strength for 
fast and slow Ss would have been the 
same. Knowing this, we can then 
make observations of recall with con- 
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fidence that differences in retention. 


area function of differences in ratesof 1 


forgetting of fast and slow Ss, and not 
a function of differences in response 
strength at the end of original learn- 
ing. 

In general, the above procedure has 
been followed. The critical step, of 
course, is to equate response strengths 
along the base line for fast and slow 
Ss. To do this we first gave mathe- 
matical expression to the curves of 
Fig. 1. After trying several types of 
curves, it was found th .t the expo- 
nential-type function ; ve the best 
fits. Essentially the formula is that 
of Hull (3, p. 119), with the exception 
that we have not forced the curves to 
pass through the origin. The two 
formulas are: 


Fast Ss: H = 95.6 — 35.20e~ 
Slow Ss: H = 92.5 — 40.91£-. 99S, 


In these expressions H is associative 
strength. The first values on the 
right-hand side (95.6 and 92.5) are 
asymptotes estimated from Fig. 1. 
The notation e is a constant given the 
value of 10, and N is the number of 
reinforcements. These formulas fit 
the data well except for the first point 
on each curve. Details of the proce- 
dure used in deriving and working 
with these curves may be found in 
Lewis (5, pp. 61-62). 

To construct exactly comparable 
base lines using the above formulas We 
used the curve for fast Ss as a refer- 
ence curve. The H values for each 
successive N for fast Ss were substi- 
tuted into the formula for slow Ss am 
the equation solved for N. is 
means that we could determine the V 
value for slow Ss which gave the same 
H value for a given N value for fast 
Ss. Thus, for each H value we have 
two N values, one for each group. 
two N values for successive H values 
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are plotted on coordinated base lines 
in Fig. 2. At any point along these 
base lines we know that the proba- 
bility that a response will be correct 
on an immediately succeeding trial is 
the same for both groups. In Fig. 2 
the highest N value for fast Ss is 10, 
this being almost equivalent in associ- 
ative strength to N of 30 for slow Ss. 

The final step is to plot the propor- 
tion of items recalled after 24 hr. for 
each group for each level of associa- 
tive strength. When this is done 
(Fig. 2) we see that no consistent dif- 
ference exists between the two curves. 
We conclude that when associative 
strength at the end of learning is 
equivalent for slow and fast Ss, no 
difference in recall after 24 hr. may be 
expected. This holds true for a wide 
range of associative strengths. Be- 
cause forgetting is rather great in 
both groups, and because it does not 
vary between groups as a function of 
associative strength, we would not 
expect differences to emerge with 
longer retention intervals. 


OTHER IMPLICATIONS 


The results of the above analyses 
lead to a strong temptation to assert 
that the critical difference between 
fast and slow Ss is that the associa- 
tive strength resulting from a rein- 
forcement is less for slow than for fast 
learners, Ina descriptive sense this is 
true, as indicated by Fig. 1. However, 
in the theoretical sense some reserva- 
tion is necessary. Between successive 
pe entations of a given item during 

arning, a maximum of about 1.5 min. 
elapsed. This interval might be as 
rd as a few seconds depending upon 

e location of the item in successive 
Presentations of the list and upon the 
intertrial interval used. It might be 
argued that if there is no difference in 
rgetting between fast and slow Ss 
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NUMBER OF REINFORCEMENTS DURING LEARNING 


Fic. 2. RETENTION OF Fast AND SLOW 
LEARNERS AFTER 24 HR. WHEN ASSOCIATIONS 
ARE EQUATED FOR STRENGTH AT THE END OF 
LEARNING 


over 24 hr., it is highly unlikely that 
there are differences in forgetting over 
these very short intervals between 
successive presentations of the same 
item during learning. Nevertheless, 
it is possible that a reinforcement for 
a fast and a slow S adds the same 
associative strength, but that in the 
context of learning a group of items 
(as contrasted with the relative rest 
over 24 hr.) more forgetting might 
take place for an item for slow Ss 
than for fast Ss over these short 
intervals between trials. Thus, it 
must not inevitably follow that rein- 
forcements add different associative 
strengths at the time they occur for 
fast and slow Ss; we believe they do 
add or produce different amounts of 
associative strength but our data are 
not conclusive on this matter. We 
can conclusively say only that when 
probabilities of response for succes- 
sive trials during learning are equated 
for fast and slow Ss, no difference in 
forgetting occurred. 

The relationships between other 
variables and forgetting may, if ana- 
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lyzed by the present technique, be 
different from those commonly be- 
lieved to be true. However, such 
analyses may be expected to be fruit 
ful only if differences produced by the 
variable during learning are fairly 
large. For example, it is generally be- 
lieved that the more meaningful the 
material the less rapid the forgetting. 
Meaningfulness does produce large 
differences in rate of learning. How- 
ever, it is quite possible that if ma- 
terials or different degrees of mean- 
ingfulness were equated for associa- 
tive strength at the end of learning, 
no differences in rate of forgetting 
would be demonstrable. It goes 
without saying that if such findings 
do occur we will need to shift our 
ideas when constructing theories of 
forgetting. 
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SUMMARY 


A review of previous methods used 
to study the relationship between 
rate of learning and rate of forgetting 
showed that none was adequate to the 
problem. In no case was it estab- 
lished that associative strength before 
the retention interval was equal for 
Ss learning at different rates. 

A method for equating associative 
strength at the termination of learn- 
ing was described and appiied to data 
obtained in learning and recalling 
paired nonsense syllables. With such 
equality established it was shown 
that no difference in forgetting oc- 
curred over 24 hr.; the recall of slow 
Ss was as good as that of fast Ss. 

The method may be useful for 
studying the influence of other vari- 
ables on retention. 
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Introduction: A Frame of Reference 


E Smith (4) maintains, social 
a ology attained its rapid growth 
€ and transformation during recent 
ier years, then it can be said 
at the study of personality lost its 


momentum during approximately 
this same period. Indeed, the latter 
seems now to have come to another 
node of its epicycloidal career, al- 
though not quite back to its starting 
point. A look at this last cycle will 
give some perspective to the reviews 
that follow. 

The decade preceding the war was 
characterized by an exhilarating drive 
to explore the whys and wherefores of 
behavior. The air was alive with dy- 
namic theories, models," and experi- 
mental thrusts into the nature of 
motivated action. Genotypical soul 
mates like Tolman's "sowbug" and 
Lewin's "baked potato" vied with 
Hullian habs under the watchful eyes 
of mathematicians, physiologists, and 
logical positivists. What quarreling 
existed was intellectually sound and 
was conducted, in the main, under 
Marquis of Queensbury rules of 
scientific inquiry. This was a period 
when evolutions inherent in the de- 
velopment of psychology reached ap- 
ogee. It was then, for example, that 
atomism seemed finally to be un- 
seated by molar interdependence, 
when instruments became less impor- 
tant than behaviors, and when classi- 
fication and empirical law were sup- 
planted by theory. It was then that 
psychology embraced the hypo- 
thetico-deductive method of science 
and began truly to shed its meta- 
physical trappings. 

It is difficult to recognize these 
prewar antecedents in postwar psy- 
chology, particularly in the area of 
personality. There is, of course, no less 
concern with the causes and courses 
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of behavior, but the method of expli- 
cation has undergone radical change. 
A major divergence has become ex- 
plicit. There is, on the one hand, an 
experimental group involved in the 
mechanics of proof, operating under 
improved ground rules of statistical 
design and rigor. On theother hand, 
there has emerged an equally vigor- 
ous clinically oriented discipline that, 
for the most part, operates with dif- 
ferent assumptions and for whom the 
nature of proof basically seems to be 
"'self-evidence.'' 

For rather different reasons, both 
groups appear to be contributing to 
the delinquency of psychology by dis- 
inheriting the scientific bent so la- 
boriously acquired during prewar 
years. The experimentalists' efforts 
are now all too often vitiated by ab- 
sorption in details and specious vari- 
ables. Unlike their predecessors, they 
seem to have narrowed rather than 
broadened the breadth of their con- 
ceptualization and pursuits. There is 
a danger that the hypothetico-deduc- 
tive method will be lost to them for 
lack of a theoretical vehicle to carry 
it. The clinical group presents the 
problem differently. They have not 
forsaken scientific method—princi- 
pally because they have never adopted 
it. Although their explanatory sys- 
tems have rarely lacked breadth,they 
have continuously lacked evidence. 
Where the experimentalist is inclined 
to recast his system on the basis of a 
single contraindication, the clinician 
is apt to “prove” his system by a sin- 
gle bit of affirming evidence. 

This picture of contrast between 
the pre- and postwar psychology of 
personality is painted, of course, 
without qualifying "shades of gray.” 
It is not so much a lament for per- 
sonality psychology as it is a notation 
of the status quo from which we will 
ultimately move. Thus, the current 
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literature contains not only manifest 
evidence of the present position as 
postulated, but hints as well at some 
lines of future development. 

Of the ten books included in this 
review, six come from other than 
American psychologists. The four 
American contributions may be clas- 
sified basically as compendia of what 
we know, or think we know, about 
personality. One is a text (Patty and 
Johnson), one is a compilation of 
theories (Blum), and two are “practi- 
cal guides" for physicians and teach- 
ers respectively (Harrower, Jersild). 

The European contributors afford 
some relief to this uniformity, al- 
though three of these books are also 
compilations (Eysenck [1953], Ver- 
non, Notcutt). The remaining three 
are of a different stripe altogether. 
One is Eysenck's (1952) recapitula- 
tion and development of a factorial 
theory of personality made known to 
Americans in 1947 (2).  Nuttin's 
book is a declared attempt to reinter- 
pret Freudian doctrine in what the 
author calls a "'nonpathological" 
sense. Spinley's volume is a research 
report which differentiates the per- 
ceptions and personalities of individ- 
uals belonging to two English subcul- 
tures. 


Eysenck—Scientific Study of Per 
sonality 


Of all the books considered, this i5 
at once the most provocative and the 
most provoking. It combines the 
virtue of sophisticated deduction an 
the vice of blatant bias. Although 
occasionally it carps and quibbles, the 
unfolding of its theme tantalizes to 
the very end. 

Eysenck sets the stage in the first 
chapter with a review of the essen- 
tials of scientific method, specifically 
emphasizing antinominalism, concep- 
tualization, and testability. Then, 
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considering the developmental stages 
of science, he consigns psychology to 
a bottom rung of this developmental 
ladder. Our need, according to 
Eysenck, is to establish a new and 
valid taxonomy of personality. The 
only method that will enable us to do 
do so now is factor analysis (p. 42). 
But not any factorial method suffices. 
Eschewing, for the noncognitive field, 
the methods of both Thurstone and 
Cattell, Eysenck presents “Criterion 
Analysis" as an alternative proce- 
dure, which purportedly combines the 
effectiveness of factor analysis and 
the hypothetico-deductive method. 

Criterion analysis begins with the 
selection of criterion groups (e.g., nor- 
mal and neurotic groups) and the ad- 
ministration of what might prove to 
be discriminating tests. Tetrachoric 
or biserial correlations with the cri- 
terion groups are obtained (criterion 
correlations) and test intercorrela- 
tions determined. Factor coefficients 
(saturations) are derived from the 
latter, and their proportionality to 
the criterion coefficients is examined. 
The presence of this proportionality 
is taken as evidence that the criterion 
factor (e.g., neuroticism) has been 
isolated and identified. 

In Chapter 4, Eysenck reports the 
first full-scale use of this method. 
Employing an axis rotation which 
maximizes the correlation between 
factor saturation and criterion in- 
dices, he makes an attempt to iden- 
tify the factor of neuroticism and its 
Subcategories. It is here that one be- 
gins to raise questions about the au- 
thor's presentation. 
pi Higin administered a battery of 76 
baste 8 MEE for factorial analysis. The 
diras x d choice is not. made clear. Pre- 
ioc rid ie number of subjects involved does 

mine the choice since in three tests 
ed and in the remainder N=96. Nor 
she * gea seem to depend on the sig- 
of the correlation ratios (pp. 127- 


150). One wonders what principle of elimina- 
tion was used. 

2. The selection of criterion groups is based 
upon psychiatric ratings with which Eysenck 
repeatedly finds fault until a positive attitude 
is useful to his argument. Thus " .. . psy- 
chiatric diagnosis is of doubtful validity and 
low reliability" (p. 33). Later, after a dis- 
cussion of inconsistent inductee rejections on 
neuropsychiatric grounds, he says: "We are 
left with the unreliability of psychiatric as- 
sessment as the most probable cause" (p. 91). 
But, he then maintains, "...as we have 
shown in the preceding section, the validity 
of psychiatric ratings for future breakdown 
has been established; it would seem to follow 
that methods correlating highly with psy- 
chiatric assessment would also show a certain 
degree of validity" (p. 99). Practically all of 
the experiments which bolster his arguments 
for a neuroticism factor depend upon psy- 
chiatric diagnoses or ratings of severity of 
neurotic disorders (e.g., pp. 106 f.). 

3. A similar tendency on the part of the 
author to favor expedient interpretations is 
exemplified by his variable interpretation of 
correlation coefficients. Thus, a correlation of 
—.21 between intelligence and neurotic 
tendencies is considered to be a “very small 
one” (p. 116) and it is interpreted as indicat- 
ing no close relationship (p. 149). Neverthe- 
less, a series of correlations between level of 
aspiration tests and examination scores “gave 
results in the expected direction,” albeit they 
are —.20 and —.11 in one case, and —.15 
and —.10 in another (p. 266). 


There is little doubt that psycholo- 
gists will follow Eysenck’s logical 
reasoning about neuroticism, despite 
the lapses indicated. But the step 
which follows will separate the en- 
vironmentalist sheep from the he- 
reditarian goats. Employing Hol- 
zinger's h? as an estimate of the 
contribution of heredity to total vari- 
ance,  Eysenck concludes that 
neuroticism as a factor is a biological 
unit which is inherited as a whole (p. 
187). The crux of his argument lies 
in the differences between identical 
and fraternal twins on tests which 
Eysenck employs to differentiate 
normals from neurotics. Although 
mean test scores are the same for the 
two types of twins, pairs of identical 
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twins are characterized by higher 
intracorrelations, and the group of 
identicals as a whole is characterized 
by greater variance. 

One possible source of irritation to 
the reader should be mentioned here. 
Eysenck seems to have the happy 
faculty of anticipating objections to 
his theses, and the unhappy one of 
meeting these objections with a com- 
bination of affirmation and partial 
recant. The assumptions underlying 
the use of h? are probably not met, 
he says (p. 189), but they may err in 
the direction of minimizing the true 
contribution of heredity. On the 
same page, however, he points out 
the possibility of obtaining a lower h? 
value in another culture if the en- 
vironmental factors of that culture 
were more pronounced. Presumably 
then, there might be a culture in 
which it could be ‘“‘proved” that the 
total variance attributable to hered- 
ity is negligible. 

Eysenck rejects the notion, attrib- 
uted to Freud, that psychoticism is a 
part of a normal-neurotic-psychotic 
continuum. Although by criterion 
analysis he distinguishes between 
“psychopath” and “‘hysteria-anxiety”’ 
types of neuroticism, he finds little 
difference between manic-depressive 
and schizophrenic types of psychot- 
icism. The reader may be inclined 
to wonder if the basic premises of 
criterion analysis are sound. Quite 
apart from Eysenck’s conclusions, it 
is probably no more difficult to dis- 
tinguish psychotic subtypes than it is 
neurotic subtypes. On a priori 
grounds one would expect psychiatric 
judgments to be at least equally valid 
in making these distinctions. Yet 
having employed the distinctions to 
create criterion groups, we are re- 
quired to believe that the differences 
we "see" are not valid because test 
behaviors of the groups do not satisfy 
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a statistical notion. In this instance, 
at least, we might conclude that cri- 
terion analysis is better used to dis- 
cover what is not known than to 
corroborate what is. 

Eysenck's book is an important one 
because of its generally convincing 
logic and its freshness of approach. 
These virtues may well operate to 
make it a new choice point in psycho- 
logical history. Such importance 
makes it imperative to evaluate this 
factorial method in the light of our 
developmental frame of reference. 
Apart from the specific criticisms 
already made, two major points 
should be emphasized: 


1. Criterion analysis presumably has the 
advantages of being a hypothetico-deductive 
. Careful examination indicates, how- 
ever, that the hypotheses are mainly validated 
internally, ie. within the confines of the 
method itself. (See particularly the first full- 
scale application of the method described in 
Chapter 4.) The few exceptions in Chapter 7 
notwithstanding, it fails to comply with the 
rigorous requirement of independent external 
validations. 

2. Criterion analysis is admittedly ele- 
mentaristic, and its sponsor is openly op- 
posed to organismic interpretations of be- 
havior or personality (pp. 277-285). Citing 
hand-picked experiments, he refutes the 
meaningfulness of gestalt qualities and sub- 
stitutes for them scalar values on orthogon 
dimensions. Nowhere is it clear how this in- 
dex description of personality can help de- 
termine the dynamics—the causes of change— 
of behavior. Nowhere is it clear how meas- 
ured body sway—a reliable index of “neuro- 
ticism"—will predict today’s equanimity and 
tomorrow's moodiness, Eysenck himself cites 
the difficulties inherent in standardizing his 
factor-loaded tests: 

“|, . whether the tester is a pretty young 
girl, a domineering male, or a mature, sym* 
pathetic woman—all these influences may, 
and in some cases do, affect a person's scores. 
They are difficult to control, particularly as 
the same stimulus—a pretty young girl as 
tester, for instance—may mean quite different 
things and arouse quite different emotions, 1" 
subjects differing in age, sex, and marital 
happiness” (p. 294). 

He might well have added here that the same 
stimulus may be perceived differently by the 


subject from day to day or minute to 
te. Not only the test, but the whole 
tical structure is thereby placed in 


jeopard y. 


Eysenck—The Structure of Human 
Personality 


In this latest book, we find neither 
essential changes nor further develop- 
ments of Eysenck's theoretical posi- 
tion. It is useful primarily as a 
Source book for those who would 
examine personality by the criterion 
of factor analysis. As such, it con- 
tains excellent summaries of factorial 
Studies classified under the following 
chapter headings: ‘“‘Analysis of Rat- 
“ings,” "Analysis of Questionnaires 
and Inventories,” ‘Analysis of Objec- 
tive Behavior Tests," “Analysis of 
Physique,” "Analysis of Physiologi- 
cal Measures," “Analysis of Inter- 
ests and Attitudes," ‘Analysis of 
Correlations between Persons,” and 

Analysis of ‘Trait’ Measurements.” 

Each chapter comprises a historical 
Survey and a critical analysis of the 
area under consideration from the 
Standpoint of its experimental worth, 
Conceptualization, and the factor 
“Methods employed. For those who 
have read The Scientific Study of 
Personality, it goes without saying 
that the critique is made through no 
ordinary orthogonal eyes, but 
through maximally rotated Eysenck- 
Yan ones. In this regard, it is regret- 
table that Eysenck does not maintain 
throughout the sense of humor which 
OWS up so well in the Preface: 
_++.it might be wise to follow 
Psychoanalytic practice and lay it 
E" that criticism of factor analysis 

ould be confined to those who had 
E €mselves been factor analysed!" 
ES We are treated, instead, to 
+ Steady display of the argumentative 
Senuity and the perspicacity that 

So characteristic of the author. 


SPECIAL REVIEW 


287 


Although the book is intended in 
part “to bring together in one volume 
some of the major theories of person- 
ality organization . . . " (p. xiii), very 
little use is made of theories or ex- 
perimental devices which do not 
imply either a factorial approach or a 
trait-type interpretation. Such or- 
ganismic or "specificity" studies as 
do appear are employed mainly as 
foils for a dimensional point of view. 

The preliminary anticipation—and 
the concluding assertion—made by 
Eysenck is that an examination of 
the seemingly contradictory data in 
factorial explorations of personality 
actually demonstrates a basic agree- 
ment. The evidence for many differ- 
ent personality dimensionsis traceable 
mainly to semantic disagreements, in- 
adequacies of the factor techniques 
employed, and the differences in aims 
of respective investigators. That 
these differences are peripheral is 
made clear by reference to a hierarchi- 
cal structure of personality organiza- 
tion—from specific responses, through 
habitual response and trait levels, to 
a type level. At this highest organi- 
zational (type) level, he believes that 
three orthogonally related dimen- 
sions are implicitly agreed upon— 
Neuroticism, Extraversion-Introver- 
sion, and Psychoticism. Eysenck 
does not imply that these are the 
only higher-order noncognitive fac- 
tors. In fact, he states: “Nor can it 
be maintained that these are the only 
higher-order factors which may be dis- 
covered; nothing can be said yet 
about the total number of such fac- 
tors required” (p. 319). Yet, curi- 
ously, in discussing second-order fac- 
tors involved in social attitudes, he 
strongly denies the likelihood of their 
being more than the R (conservative- 
radical) and T (tough-tender minded 
or practical-theoretical) factors al- 
ready determined (p. 244). Such 
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assurance seems out of keeping with 
the scientific spirit so readily in- 
voked elsewhere by the author. 

Eysenck's books, considered to- 
gether, constitute an extraordinarily 
well-documented and tightly woven 
thesis. Although in reading them one 
encounters a surfeit of "factors" and 
"dimensions," there is no doubt that 
the internal reasoning is both chal- 
lenging and sound. They suffer 
particularly from a paucity of ‘‘ex- 
ternal” (e.g., anthropological) sup- 
portive material such as distinguished 
Cattell’s (1) factorial approach to 
personality. For those who are fac- 
torially inclined, the system outlined 
may be completely acceptable as a 
pathway to personality. For others, 
its monorail construction may prove 
to be discouraging. Whatever his 
point of view, no psychologist can 
afford to ignore it. 


Vernon—Personality Tests and As- 
sessments 


After the supercharged atmosphere 
engendered by the Eysenck books, 
this slim volume comes as a cool and 
refreshing breeze. It covers almost 
exactly the same ground as Eysenck’s 
(1953) review of personality tests, 
but it does so without restricting the 
material to factorial studies ex- 
clusively. Although the book is only 
200 pages long, it reports succinctly 
on attitude and interest measure- 
ment, ratings and questionnaires, 
physical and behavioral tests, and in- 
terview assessments. In addition, 
there is material on cognitive tests, 
and a chapter on projection (English 
nomenclature for the American “‘pro- 
jective”) techniques. 

Vernon reports and interprets data 
in a calm and studied manner. Be- 
cause of its brief but unhurried di- 
gestion of assessment devices and its 
cogent but dignified criticism, this 
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book is recommended to the students 
and scholars alike who desire an un- 
biased orientation to the field of per- 
sonality testing. It brings no solace 
to those who seek vindication of one 
or another technique, but treats all 
techniques as legitimately adven- 
turous early steps in the learning- 
to-walk period of valid assessment. 
Vernon's book is written from what 
he calls a “trait point of view,” but 
not so rigidly as to make him an idol 
worshiper. He accurately points out 
the difficulty of selecting distinctive 
syndromes of behavior for criterion 
purposes, the unreliability of psy- 
chiatric and associates’ ratings used 
for the same purpose, and the prob- 
able existence of traits which ‘cannot 
be anchored to any syndrome” (p. 
11). Even more important, he recog- 
nizes the artificiality of studying per- 
sonality in isolation from society. 
Vernon’s contentions, that “ ee 
an infinite number of solutions exist 
to any factorial problem" (p. 12) and 
that "no two leading psychologists 
agree as to which are the best di- 
mensions" (p. 12), are in marked op- 
position to Eysenck's position on 
both points. So, too, is his belief that 
Q- and P-techniques are especially 
fruitful. The two authors do, how- 
ever, find common ground in their 
agreement upon the subjectivity and 
unreliability of the “clinical” ap- 
proaches. Eysenck would discar! 
them entirely on this account, but 
Vernon argues for the use of objective 
tests and techniques to back them up 
or correct them. p 
Vernon’s chapters follow a fairly 
consistent pattern of concisely re- 
viewing subtechniques which fa 
under selected rubrics; each includes 
clearly labeled discussion sections 
when the author feels they are war- 
ranted, An example of this pro- 
cedure, well worth reading, is t^e 
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discussion of paper-and-pencil per- 
sonality tests in the chapter, “Self- 
Ratings and Personality Question- 
naires” (pp. 136-140). 

His eight-page concluding chapter, 
"Conclusions and Future Develop- 
ments," is a masterpiece of fulsome 
brevity. It conveys neither the ex- 
tent to which his summaries cut 
cleanly to the heart of the evalua- 
tion, nor the number of research pro- 
grams implicitly designed. Its three 
sections deal with the situations in 
which personality assessments are 
principally required: selection, ex- 
perimentation, and diagnosis or guid- 
ance. The use of personality tests in 
selection procedures is the most suc- 
cessf ul, Vernon feels, because theoret- 
ical problems of personality are at a 
minimum, and external validation 
criteria are readily available. 

From the standpoint of diagnosis, 
guidance, treatment, and control, 
however, the reverse is true. Theoret- 
ical problems are maximal, and 
criteria—both internal and external 
—are generally lacking. For those 
who are familiar with Vernon as a 
protagonist of factorial and trait- 
composite research, his recognition 
that they are cross sectional, and not 
dynamic, may come as some surprise. 
Even more unexpected, perhaps, is 
his hope “that a more fruitful system 
will eventually replace [trait psychol- 
ogy]" (p. 205). 

His final plea is expressed as a 
need for the direct recording of be- 
havior. This, he feels, will reconcile 
trait psychology with field theory, 
With the study of social groups and 
processes, and with longitudinal 
Studies of personality development. 

Nowhere, perhaps, will the Ameri- 
can psychologist be brought face to 
àce with one of his own problems 
more clearly than in this book. His 
English colleagues have observed and 


have often pointed out the limita- 
tions of clinical techniques, such as 
interviews, Rorschachs, and ratings, 
when used as everyday diagnostic 
tools. Although the critical studies 
most often cited have been made in 
this country, they have had less in- 
fluence here than abroad in actually 
changing practice. It appears to be 
difficult for at least two English 
psychologists of repute—Eysenck 
and Vernon—to understand how we 
can, at the same time, be scientifically 
self-critical and clinically self-indul- 
gent. This trait composite has yet to 
be explored by the culture-oriented 
psychologist. 


Spinley—The Deprived and the Privi- 
leged 

Spinley’s book serves to show how 
a good idea can go wrong. The idea 
itself is symptomatic of one probable 
trend that personality psychology 
can be expected to take in the future. 
Two hypotheses imply, roughly, the 
direction of this development: 

1. Groups exist which can be distinguished 
by their material and social environments. 

2. Over and above individual differences, 
there are “personality types” which can be 
associated with, and attributed to, specific 
cultures or subcultures. 


Spinley would add to these a third 
hypothesis, that "there is a basic per- 
sonality type; therefore the groups 
will possess some cultural aspects and 
some personality trends in common" 
(p. 18). Her book, a modification of 
her doctoral dissertation, attempts to 
substantiate these statements. 

The populations chosen for ex- 
perimental verification consisted of 
two English subcultures: one was a 
"slum group," the other a group of 
economically privileged graduates of 
English Public Schools. The author 
unfortunately obtained her data from 
the two groups in quite different 
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ways, Among the slum dwellers she 
played the role of participant ob- 
server in successive age-group clubs 
in a settlement house. In addition, 
she utilized structured interviews 
with informants selected from among 
social workers and mothers. The 
privileged group data consisted of 42 
solicited life histories that were sup- 
plemented by interviews with nurses- 
in-training to establish optimal child- 
rearing practices. Rorschach tests 
were administered to samples of 30 
people from each of the two subcul- 
tures. 

One of the most informative chap- 
ters is that in which data from the 
deprived group are collated. A series 
of 117 items characterizing this 
group are arranged in maturational 
order, and each is coded according to 
its source: statements based on field 
observation and supported by the 
large majority of informants; state- 
ments based on field observation, but 
with the opinion of informants fairly 
evenly divided ; statements based sole- 
ly on observation; and statements 
based solely on data from informants. 
The data from the privileged groups 
are extracted from the life histories 
and hence do not necessarily overlap 
the statements about the deprived 
group. Nevertheless, the author de- 
rives a series of characteristics that 
purportedly typify the personalities 
within the respective groups. In arm- 
chair fashion she then attempts to 
reason out the culture-causal factors 
producing each. Finally, having ad- 
ministered and scored the Rorschachs 
herself, she finds by comparative 
analysis that her data are consistent 
in differentiating privileged from 
deprived personalities as she depicted 
them. 

Spinley's book could well be labeled 
a study in unreliability. Neither the 
instruments nor the judgments are 
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shown to be reliable. Her own re 
liability is not guaranteed, and her 
role introduces the dangers of halo 
effect and unintentional bias. It 
should be noted that Spinley is aware 
of these sources of methodological 
impurity, but pleads the limitations 
of time and resources of the lone in- 
vestigator. 

When objectives rather than meth- 
ods, are considered, it would appear 
that Spinley in some respects 
taken her cue from Vernon in looking 
for socially induced personality syn- 
dromes. She is, of course, not the 
first to do so, nor is her approach es- 
sentially different from the workaday 
methods of sociology and cultural 
anthropology. To the “personality 
psychologist," however, a change in 
objective might be desirable. Per- 
haps this could be an emphasis on 
socially induced perceptual proto- 
types rather than on socially induced 
behavioral prototypes. Better still, 
we might hope to trace the relation- 
ship between the objective social 
situation, the perception of that situ- 
ation, and the behaviors which follow 
from it. In this way, we might crys- 
tallize the functional interplay of the 
many disciplines which study society 
and the person. 


Notcutt—Psychology of Personality 


Among the books reviewed here, 
this is the only one which expressly 
attempts to present the gamut of con- 
temporary theories of personality, 1n- 
cluding the techniques used, the re- 
sults obtained, and the applications 
thereof. The categorization of pèr- 
sonality theories that Notcutt em- 
ploys as a point of departure leads 
one to expect a thorough and uniform 
development of each, somewhat 1n 
the manner of Hilgard's treatment 0 
learning theories. Instead, one finds 
a hodgepodge of historical reflections 


ded with tabulations of basic 
í , personality types, and de- 
e mechanisms. 
In twelve extremely short chapters 
itcutt covers the history, interac- 
and interpretation of seven 
es, two major assessment meth- 
and the logic of validation. The 
age, as might be expected, is 
and thin. The depth of treat- 


Lis said that in some parts of the United 
morticians employ minions to listen in 
the short wave police radio, so that when- 
[ fatal accident is reported, they can 

hurry to the spot and sell a funeral. If psy- 
Ogists had shown more enterprise of this 


the subject might have progressed 
(p. 185). 


In short, Notcutt’s abortive analy- 


erable conflict between psycho- 
lytic theory and certain religious 
efs. Nuttin's book has the dis- 
ction of being among the first to 
al with this problem by reinter- 
g and extending a psycho- 
lytic point of view, rather than 
; ly opposing it. 

Nuttin gives an unprejudiced ac- 
Count of Freudian theory in the first 
apter, but gradually and grace- 
weaves into it the points he is 
fee later in refuting its application 
- normal" behavior dynamics. 
nus, Freud failed to examine seri- 
y the role of the conscious ego; 
dian theory is inadequately bol- 
ed by experimental evidence; 
Choanalytic theory is not insep- 
ly connected with psychoanalytic 
by. Arguing that psychoanalysis 
hot the only psychotherapeutic 
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method, Nuttin points to Rogerian 
nondirective therapy as a desirable 
alternative. This particular choice 
supports the position that such 
therapy “permits the patient to con- 
struct his own personality” (p. 91) 
and that it is based upon a “social” 
rather than a directive relationship. 

Nuttin is now ready to reject un- 
conscious motivation in favor of free- 
dom of will as a major determinant of 
normal behavior. An interesting 
point in his thesis is that very little 
human behavior is not unconscious, 
whether it be of psychic or biological 
origin. What is crucial, he says, is 
not the source of needs, but the felt 
need and its influence. It follows 
from this, he continues, that the psy- 
chology of normal people need not be 
based upon repression but upon the 
law of effect. Thus normal behav- 
ior develops according to success 
achieved and consciously known. 
Undesirable behavior disappears be- 
cause “it does not continue to bring 
satisfaction to the whole personality” 
(p. 188). Nuttin completes his argu- 
ment with the postulation of three 
levels of human activity correlated 
with basic needs: a psychophysiologi- 
cal level, a psychosocial level, and a 
spiritual level. The last level is co- 
ordinated particularly to man’s need 
for self-preservation and self-develop- 
ment. 

Nuttin has evolved a theory of 
normal behavior to fill the gap pre- 
sumably left by those concerned with 
pathology alone. In so doing, how- 
ever, he creates a dichotomy which is 
untenable in psychological theoriz- 
ing. His highly speculative and 
metaphysical reasoning is supported 
only by an occasional clinical inter- 
view. His conclusion is completely 
predictable. Nonetheless, his at- 
tempt to cope with, rather than op- 
pose or proscribe, “controversial” 
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material is a refreshing change. It 
seems likely that Nuttin's book will 
find wide acceptance in those groups 
which need to reconcile psychother- 
apy and religious faith. Despite its 
persuasiveness, it is not likely to gain 
wide acceptance among personality 
theorists. 


Jersild—Jn Search of Self 


This book by Jersild—the first of 
the American authors represented 
here—is a remarkable reiteration of 
Nuttin's premise regarding the be- 
havior of normal people. Nuttin 
argued that people operate in the 
direction of constructive self-realiza- 
tion, and achieve it through the 
operation of the law of effect. Jersild, 
influenced by Lecky (3) and Harry 
Stack Sullivan (5), holds similarly 
that “a person accepts and incorpo- 
rates that which is congenial to the 
self-system already established, but 
he seeks to reject or avoid experi- 
ences or meanings of experiences 
which are uncongenial” (p. 14). 
Unlike Nuttin, however, he believes 
that people may show either self- 
acceptance or self-rejection, the for- 
mer being essential to mental health, 
the latter its unhealthy counterpart. 

In Search of Self reports a fact- 
finding procedure designed to help 
teachers better understand their stu- 
dents and to assist them in achieving 
self-acceptance. It consists of a 
tabulation of items reported by stu- 
dents in written essays on “What I 
Like About Myself" and “What I 
Dislike About Myself." The re- 
sponses of almost 3,000 individuals, 
ranging from fifth-graders to college 
seniors, are reported according to 
grade level, sex, and type of response. 
Unfortunately, statistical informa- 
tion is limited to frequency and per- 
centages of individuals reporting in 
each category. Although the data 
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actually suggest little more than that 
young people may like or dislike 
themselves for almost an infinity of 
reasons, Jersild builds precariously on 
this sparse evidence in his discussion 
of categories of self-acceptance and 
rejection. He could readily have 
made his appropriate and instructive 
remarks without reference to the re- 
search data. His most valid gen- 
eralization, labeled the ‘universal 
language of the self," is the recogni- 
tion that "categories of self-evalua- 
tion at any one age level are also 
prominent at other levels" (p. 30). 

The contribution of Jersild's book 
may not be unique from the stand- 
point of personality theory, but it is 
certainly a timely and well-developed 
contribution to educational theory 
and practice. Part Three of the book 
is devoted to the problem which the 
school and teacher must both meet— 
an adequate setting for the indivi- 
duality of student needs, with par- 
ticular reference to self-acceptance. 
To this point, he says: 

The failures, reminders of limitations, and 
the rejection which children face at school are 
often artificial and forced. They may have 
the effect of humiliating the child by de 
preciating his worth in a manner that does no 
good to society and does him great harm. 
Much of the failure at school is contrived- 
Much of the depreciation children encounter 
there is based upon false evaluation (p. 91). 

With regard to educational “de- 
vices," Jersild points out the un- 
certainty of their results. In group 
discussions, for example, anxiety may 
not be released, but by-passed in 1n- 
tellectual or logical flight. On the 
other hand, individuals may betray 
by their exaggerations, not so much 
their thoughts as their feelings On 
the subject under discussion. 
becomes the job of the teacher to ap- 
preciate the individual effect and to 
shepherd the student's struggle to be 
himself. In so doing, the teacher will 
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himself gain opportunity for pro- 
ductive rather than conforming be- 
havior. In addition, he will be better 
able to cope with his own problems, 
particularly those evoked by con- 
tact with students. 

Finally, Jersild clarifies the posi- 
tion of the teacher as a psychologist. 
He is not, of course, to take on the 
role of the professionally trained psy- 
chologist or psychiatrist. However, 
"every teacher is in his own way a 
psychologist. Everything he does, 
says, or teaches has or could have a 
psychological impact. What he offers 
helps children discover their resources 
and limitations. He is the central 
figure in countless situations which 
can help the learner to realize and 
accept himself, or which may bring 
humiliation, shame, rejection, and 
self-disparagement” (p. 125). 

In summary, Jersild's book has the 
virtue of attempting to translate per- 
sonality theory into an action level. 
Those who operate in the rarefied 
atmosphere of theory for theory's 
sake may find it speculative and 
wandering at times, but will not re- 
ject it. On the other hand, the class- 
room teacher for whom it is surely 
intended and appropriate may find it 
too abstract and theoretical to fol- 
low. Perhaps its optimal place is in 
the hands of those who teach teach- 
ers, as a working guide to a sophisti- 
cated educational philosophy. 


Harrower—A ppraising Personality 


This book was ostensibly written 
to acquaint the medical practitioner 
with the nature of psychological 
testing and its value as an adjunct 
to diagnosis and treatment. More 
obvious is its attempt to establish 
the respectability and validity of cer- 
tain projective techniques, partic- 
ularly in the hands of a trained psy- 
chological interpreter. The devices 
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with which Harrower is concerned are 
the Szondi test, the Miale-Holsopple 
sentence completion test, human- 
figure drawing, Wechsler-Bellevue re- 
sponse "scatter," and the Rorschach 
test. 

The method of presenting these de- 
vices is an unusual one—a not very 
Socratic dialogue between a physician 
and a clinical psychologist. The 
physician is no ordinary general 
practítioner, being much too well 
primed. He asks about the Szondi 
pictures, "What are these strange 
looking faces?" and later, “In what 
way may I be legitimately paranoid? 
What does it show if I like the 
paranoid faces?" The psychologist 
replies with characteristic clarity, 
“You would accept consciously the 
need to be emotionally driven beyond 
yourself, to become involved in or in- 
volved with other things or persons 
and by so doing extend your own 
frontiers and boundaries" (p. 126). 
It is unfortunate that instances of 
this type of dialogue occur all too 
frequently, the psychologist speaking 
with equal assurance on each such 
occasion. lt is possible to summarize 
his confidence (Harrower identifies 
with a male psychologist in this 
book) on the basis of the statement 
“|. by and large, psychological 
tests have the same margin of error 
as do those concerned with somatic 
phenomena." She is referring spe- 
cifically to the 10 per cent of false 
positives on the Wassermann test, 
which was taken as a sample of 
somatic diagnostic techniques. It 
would seem that Harrower's inter- 
pretation of validation studies is at 
great variance with that of Vernon, 
Eysenck, and most American psy- 
chologists. While all might place 
credence in the ability of these de- 
vices to mirror some extremes of per- 
sonality, their present determinable 
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accuracy for an unselected group 
would certainly be less than 90 or 
even 50 per cent. 

Harrower does present one area 
with more restraint, viz., her inter- 
pretation of “scatter” on the Wech- 
sler-Bellevue. Especially good is her 
review of responses to single test 
items, particularly with respect to the 
clinician's opportunity to make judg- 
ments by observing the patient in 
action. It seems quite likely that 
much more of the diagnostic success 
of these techniques is attributable 
to such observation than to the test 
indices and scores themselves. This 
point is corroborated in Harrower's 
(synthetic?) cases reported in testi- 
monial fashion in the last section of 
her book. A number of physician 
referrals are seen by the psychologist, 
who reports back to the physician the 
psychological status of the patients. 
The first of these—under the title 
"Which Twin Has Epilepsy"—pre- 
sents no problem if one ignores the 
test results but reads carefully the 
behavioral description. This re- 
viewer, at least, was able to make an 
educated guess (a correct one), and 
presumes that with prodding he could 
also have told which twin had the 
Toni. 

The next case deals with a “back- 
ward" girl, about whom the physi- 
cian inquires, "Would you give me 
your opinion as to the chances of her 
success in a therapeutic venture?" 
The answer would be obvious to most 
who had determined her verbal IQ 
to be 50. With Harrower, it requires 
the combined virtues of Szondi and 
Rorschach tests to divulge a psy- 
chotic ego picture, and hence con- 
traindicate therapy. 

Psychologists of logical bent know 
that somewhere they must hold the 
line, theoretically speaking. This 
book, although certainly written from 


honest convictions, is apt to desig- 
nate where they will draw the line, 


instead. 4 
Blum—Psychoanalytic Theories of 
Personality 
Blum's avowed goal is to supply a 
framework for psychoanalytic per- 
sonality theory since it is at once the 
"most comprehensive one of its 
species" (p. viii), and has fared the 
best in practical applications. Al- 
though we may question these prem- — - 
ises, it is apparent that the author has 
summarized exceedingly well the y 
many and varied psychoanalytic 
points of view represented. Re- 
grettably, the coverage is more ex- 
tensive than intensive, and its struc- 
ture encourages browsing curiosity 
rather than intellectual involvement. 
Each chapter of the book repre- 
sents a segment of the chronological 1 


sequence of personality formation, 
psychoanalytically conceived. It 
K 


thus begins with “Prenatal and Birth 
Influences," includes the ‘‘Latency 
Period,” and ends prematurely with 
“Adult Character Structure.” With- 

in each chapter, subsections are de- 
voted to ego and superego formation, 
psychosexual development, relations. 

with others, and mechanisms. The 
subsections are in turn subsected (an 
induced neologism) to note the posi- | 
tions taken by  Freudians, neo- 
Freudians, and other psychoanalytic 
theorists. Each chapter ends with a 
summary and author's notes; the 
latter consist of reports of experi- | 
mental research presumed to be re- ' 
lated to the chapter material, sug- | 
gestions for further research, and the 
author's critical comments. 

The organizational structure of the — 
book seems to defeat its purpose ina 
number of ways. The persons for 
whom the book was written—gradu- 
ate and undergraduate students in 
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psychology and related disciplines— 
will not enjoy reading it through be- 
cause of the difficulties of assimila- 
tion inherent in its discontinuity. If 
they attempt to trace a single concept 
such as “libido-regression,” they will 
be forced to turn repeatedly to the 
index and to the respective sections 
where the term occurs. To compare 
one such concept with another makes 
the process doubly inconvenient. 

For research workers, the structure 
is equally hampering. Blum, in his 
notes on each chapter, summarizes a 
variety of recent psychoanalytic re- 
search materials. The summaries, 
however, are so embedded in other 
material and so far removed from the 
theoretical content to which they 
refer that they appear to be super- 
ficial and unnecessary. The major 
accommodation for the research 
Worker is actually the 295-item bib- 
liography. 


Patty and Johnson—Personality and 
Adjustment 


Any textbook on personality must 
comply with unusual demands. It 
Must not be too lofty, erudite, or 
theoretical—nor too elementary, 
Vague, or contradictory. It must 
give the essentials of theories without 
vitiating them, and it must do so in 
readable fashion. It must, above 
all, enable the student to see the re- 
latedness of theories to the people 
from whom ultimately they are 
derived and to whom they will apply. 

Patty and Johnson fall far short of 
Roe utopian goals and some lesser 
EL At first glance, the Table of 
s ntents gives promise of a well- 

Juctured approach to mental hy- 
Es using personality theory as a 
"hm e. But the development of the 
E to this end is certain to be 
2 nem or misleading to the 

"Cent. An example of this comes 
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early in the chapter on motivation, 
where Lewin's topology is intro- 
duced: 

This viewpoint made extensive use of field 
theories, by means of charts showing arrows 
(vectors) symbolizing "behavior tendencies" 
or "pressures." ... It stresses the fact that 
animal behavior seems to maintain a sense of 
direction in a complex situation. Its weak- 
ness lies in the difficulty, if not the impos- 
sibility, of describing behavior in sufficient 
ree by means of the available symbols (pp. 
49 ff.). 


It is a rare student who will derive 
from this any concept of Lewin's dy- 
namics, or do more than reject them. 
It would be far better to have left it 
unsaid. 

The frequency of misconceptions 
and misdirections is so great as to 
make recasting unbelievably difficult 
for the instructor and impossible for 
the student. The chapter on percep- 
tion, a keystone in the author’s de- 
velopment of behavior determinants, 
consists of exactly two topics: illu- 
sions and “‘set,” and projective tech- 
niques. Chapter 7 contains this un- 
verifiable datum: 

It appears that mental ill-health in the ex- 
pectant mother affects the development of 
the child in ways which may appear later as 
anemia, nervousness, underweight, and other 
equally general conditions (p. 160). 


There is little use in multiplying 
these instances, which are primarily 
supplied to let the reader make his 
choice of text both forewarned and 
forearmed. 


Epilogue 

In these last remarks, I should like 
to claim the privilege of forsaking 
the editorial third-person and give 
my pen a reprieve from conven- 
tional expression. 

The lot of the reviewer, like that of 
the policeman, is not a happy one. 
When he wants to say, belligerently, 
that the author tried to stuff a 
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rhinoceros down a mouse hole, he 
feels restrained by the knowledge 
that those who criticize reviewers are 
rougher than those who criticize 
books. If he attempts a bit of 
whimsy, he may be accused of being 
precious or puckish. If he's for some- 
thing, he's really against someone. 
It is probably thus that reviewers 
develop tough hides that cover once 
tender hearts. 

In looking back on these reviews, 
I am struck by their general nega- 
tivity. I do not think that this comes 
from looking too long through jaun- 
diced eyes. I am inclined to believe 
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that for books on personality the 
year has indeed been fallow and lack- 
ing in conceptual breadth. But 
Pollyanna-like, I feel that it repre- 
sents only a breathing spell, a period 
in which we consolidate our gains and 
prepare to strike out in new direc- 
tions. It was for this reason that 
I characterized our psychological 
course as epicycloidal, and our posi- 
tion as nodal. It is fortunate that the 
analogy stops here—for the epicycle 
comes back to its starting point, 
while we may, with scientific out- 
look, hope to approximate true 
rather than apparent motion. 
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Buros, Oscar KRISEN. (Ed.) The 
fourth mental measurements year- 
book. Highland Park, N. J.: 
Gryphon Press, 1953. Pp. xxiv 
+1163. $18.00. 

Oscar Krisen Buros’ The Fourth 
Mental Measurements Yearbook is the 
seventh in a series of publications 
intended to make available to prac- 
titioners in psychology, education, 
and related disciplines the rich re- 
sources of tests and measurements. 
The series began modestly with the 
little pamphlet “Educational, Psy- 
chological, and Personality Tests of 
1933 and 1934” (issued in 1935), fol- 
lowed by a cumulated monograph 
“Educational, Psychological and Per- 
sonality Tests of 1933, 1934 and 
1935” (issued in 1936). A year later, 
Buros published “Educational, Psy- 
chological and Personality Tests of 
1936: Including a Bibliography and 
Book Review Digest of Measure- 
ment Books and Monographs of 
1933-36” which, however, abandoned 
the notion of cumulating the test 
entries. In 1938 was published, in 
book form, The Nineteen Thirty 
Eight Mental Measurements Yearbook 
of the School of Education, Rutgers 
Miereity, followed in 1941, by The 
pen Forty Mental Measurements 
E and in 1949 by The Third 

ental Measurements Yearbook. 

The current volume is a beautifully 
Printed and well-organized tome of 
oval than 1,100 pages. In no way 
lied it be compared with the first 

oH pamphlet. The earliest three 

E lications were primarily test bib- 

E apice. The shape of things to 
me was first evident in the 1938 

Publication. For in that volume, 
uros undertook the prodigious tasks 


of getting cooperative reviews of 
tests (issued as separates in the 
English-speaking countries) and of 
compiling significant excerpts from 
published reviews of books and 
monographs related to tests and 
measurements. In 1940, he made 
this task his by tradition; in the 
present volume, he has dignified the 
tradition by an opulently produced 
book. 

The 1953 volume is intended to be 
a bibliography as well as a review of 
tests published in the period 1948- 
1951. But Buros has continued the 
practice of adding to the bibliography 
previously unlisted items and, when- 
ever possible, reviews of such tests 
published since the series began in 
1933; the scope is considerably 
greater in the current volume, which 
lists, for the first time, restricted 
tests together with reviews of a few 
of them. Indeed, Buros, as well as the 
test organizations, is to be congratu- 
lated for removing some of the so- 
called secrecy from the instruments 
utilized by agencies like the College 
Entrance Examination Board, the 
Educational Testing Service, and the 
National League for Nursing Educa- 
tion. Further, the volume continues 
the practice of listing and giving ex- 
cerpts from books and monographs 
related to tests and measurements 
for the period covered, 1948-1951. 

In a sense, Buros’ publications re- 
present a compromise. In the early 
thirties he intended to develop a test- 
consumers’ research organization. 
Failing to realize such an accom- 
plishment, he devoted himself to the 
lesser, but very important, goal of 
supplying to all, particularly to test 
consumers, “frankly critical re- 
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views." Such a limitation, of course, 
meant that he had to become de- 
pendent on outside reviewers rather 
than upon members of his own re- 
search staff. During the years, Buros 
has had the cooperation of the lead- 
ing specialists in test construction 
and test utilization. Nor has Buros 
left the task to them alone; he has 
supplied the bibliography of pub- 
lished research related to each listed 
test. Such cumulation of related 
material has been, and should con- 
tinue to be, a valuable resource for 
the student and the scholar. For ex- 
ample, the cross-references for the 
Wechsler-Bellevue Intelligence Scale 
now goes to 312; and for Rorschach to 
1,219. As for books about measure- 
ment, the review excerpts that he has 
edited and compiled make another 
basic contribution toward appreciat- 
ing the growing sophistication of test 
constructors. 

An editor who conceives his func- 
tion to be the improvement of test 
making and hence the improvement 
of test using deserves well of all who 
make and use tests. Indeed it is fit- 
ting that the Division on Evaluation 
and Measurement of the American 
Psychological Association and the 
Psychometric Society jointly ex- 
pressed their appreciation to Buros 
for his enterprise over the past 20 
years. The fact that these two or- 
ganizations memorialized Buros indi- 
cates the value and significance of the 
work. Indeed, the volume's defi- 
ciencies seem entirely attributable to 
Buros' limited financial resources 
rather than to any administrative or 
intellectual constraints. The volume, 
in so far as it is not cumulative, 
makes it necessary for the interested 
: user to manipulate several volumes, 
for test reviews are to be found in the 
four volumes for 1938, 1940, 1949, 
and 1953. Perhaps the time has now 
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come when test listing and test 
viewing can be done best through t 
medium of an official magazine s 
sored by the American Psych 
ical Association. Such a public: 
would give greater currency a 
newer tests faster. With the expan 
ing test field, it seems too long to 
have to wait four or five years for 
Buros’ next volume. Indeed, if sub- — 
scribers are willing to pay $18 every _ 
four or five years, there may be even 
more who can afford $4 or $5 every 
year for more current material. — 

Buros, in his attempt to be fair to — 
every test maker, has “leaned over 
backwards.” At one time, he for 
warded the critical reviews to the test” 
authors and/or publishers to make - 
certain that the critiques were Te 
lated to the facts. This apparently 
was so formidable a task that heaban- ^ 
doned it in favor of trying to achieve 
at least three reviews of each test by —— 
different analysts. Fair as that may - 
be, it seems not nearly as valuablean 
objective as getting at least ome Te 
view of every listed test. In a random 
sample of 20 entries, four were not 
tests at all (a manual on aphasia, à 
book on Rorschach, a manual for 
driver selection, a series of forms for 
employee selection) ;seven were of new k 
tests not previously listed, but which ~ 
remained without review; three were { 
for new tests not previously liste? 
with a review; four were for re ) 
of a revised or supplemented test; 
two were reviews for tests which i | 
been previously listed and reviewed. 
Perhaps editorial energies could have 
been expended to get all the new en- 
tries reviewed at least once. T 

It is not difficult, however, to rec- — t 
ognize that the reviewers themselves , 
frequently add their personal preju- — | 
dices to their test appraisals. Some 
times these biases are minor Ones, 
such as criticizing the direction’: ' 
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sometimes they are major ones, such 
as questioning the use of primary bat- 
teries of standardized achievement 
tests. For one reasonably familiar 
with the proclivities of the individual 
reviewers, it is quite a confirming re- 
action to be able to designate the re- 
viewer just from the review. In a 
field as diverse as testing, various 
specialists have developed special 
attitudes: e.g., criticism of objective 
scoring as restricting the scope of 
testing, or the impossibility of getting 
truly representative national norms, 
or the overclaims on diagnostic pro- 
files. As is inevitable, the reviews are 
uneven because of the very speciali- 
zation of the reviewers. In one sense, 
the user of the volume should have 
available specifications of “all the 
players." 

: Nevertheless, most of the advice is 
significant and cogent. A few quotes 
give the flavor of the volume: the 
test has “little to recommend it . . .in 
preference to many similar commer- 
cial tests on the market," or “the 
items in the test also have a wide 
range of difficulty, the earlier items 
being very easy; but the last, even 
with the answer key available, gave 
the reviewer considerable difficulty,” 
or "among the major ones are: insuffi- 
cient data about what it predicts, 
temptation to do elaborate pattern 
analyses on subjects' scores, some 
urban-rural inequity in the standardi- 
zation sample...” or ‘‘... strong 
relation to the . . . [somebody else's] 
reading tests in form, vocabulary, and 
El testing procedure; and the 
i grade tests could well be addi- 
Enel forms of the [somebody else's] 
3 ests,” or “study of the items reveals 

onsiderable dependence upon recall 
of factual information. ..." Future 

Ep of the volume will profit 
Er attempts to classify the relevant 

ormation by validity, reliability, 
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sampling for normative data, char- 
acter of norms, relevance for specified 
objectives of appraisal or of instruc- 
tion. 

It is the lack of a comprehensive 
plan for coverage that makes for the 
primary difficulty with the reviews: 
they tend to be so prolix and discur- 
sive as to omit much information that 
a consumer should have for his de- 
cisions. Certainly, the bulk of the 
reviews fails significantly to meet 
Buros’ requisite: "Reviews should be 
written primarily for therank and file 
of test users." Too often, the reviews 
seem to be prepared for the reviewer's 
peers. 

The current reviews, however, 
occasionally give evidence that ear- 
lier reviews have led the authors to 
make improvements in subsequent 
revisions of their tests; unfortunately, 
in too many instances, the reviews 
give contrary evidence that the test 
authors were immune to such analy- 
ses. Over the 20 years, however, 
Buros has raised the standards for 
the technical aspects of test making. 
Itis hoped that a magazine may con- 
tinue these important reviews to 
make them more up to date and 
available at a price that will allow 
the rank and file to use them more 
freely. I know of no one as capable 
as Buros to be its editor. 

IRVING LORGE 

Institute of Psychological Research 

Teachers College, Columbia 
University 


MowRER, O. H., and 21 Contributors. 
Psychotherapy, theory and research. 
New York: Ronald Press, 1952. 
Pp. xviii4-700. $10.00. 


This is an important book, but, by 
virtue of the range of content and 
number of authors represented, it is à 
difficult one to review. Part I, en- 
titled “Theory, Concepts and Appli- 
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cations," consists of eight chapters, 
two by Mowrer and six by other con- 
tributors. Part II is entitled “Re- 
search Methods and Results" and 
consists of 11 chapters. Of these, 
Mowrer is the author of two and co- 
author of three more; the remaining 
six chapters represent the work of 
other contributors. Mowrer also pro- 
vides a brief Preface and Introduc- 
tion; there is, however, no attempt to 
summarize or integrate the many di- 
verse contributions, which vary 
widely in both content and quality. 
All but two of the 19 chapters were 
written expressly for the present vol- 
ume. Finally, the book is big: the 700 
pages average about 500 words per 
page and include 21 tables, 120 fig- 
ures, 211 footnotes, and a bibliog- 
raphy of over 300 items. 

In his Preface, Mowrer states that 
"mental disorder and the challenge of 
its more effective understanding, 
treatment and prevention constitute 
one of the major preoccupations of 
our contemporary civilization." He 
observes, however, that only since 
World War II have psychologists 
concerned themselves centrally with 
psychopathology and psychotherapy. 
This volume, written exclusively by 
psychologists who for the most part 
are experienced therapists as well as 
active investigators, is presented as a 
partial report of progress regarding 
developments on this research fron- 
tier. Because of this general limita- 
tion to authors engaged in actual 
research on psychotherapy, the vol- 
ume is in no sense representative of 
either the theory or practice of 
present-day psychotherapy. It does, 
in the 500 pages of Part II, include a 
good sampling of the research meth- 
ods used and the preliminary results 
obtained by psychologists doing re- 
search in psychotherapy. 

Part I begins with a delightful 
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chapter by Rollo May tracing the 
historical development of social atti- 
tudes toward neurotic manifestations 
and the modes of treatment provided 
at different historical periods. This 
is followed by an admittedly specula- 
tive chapter by Carl Rogers in which 
he emphasizes the importance of 
examining the process of psycho- 
therapy entirely apart from its utility, 
and after conceding “that the process, 
directions, and end points of therapy 
may differ in different therapeutic 
orientations," attempts to delineate 
the major common aspects of the ex- 
periences of clients undergoing non- 
directive therapy. In the third chap- 
ter, Mowrer undertakes to develop 
briefly a systematic :ccount of both 
“neurosis and psychotherapy as inter- 
personal processes." In this paper, 
Mowrer continues his argument with 
more orthodox psychoanalytic theo- 
rists on the question of the nature of 
repression and its implications for 
therapeutic strategy in handling 
transference. 

Chapter 4 by William Perry and 
Stanley Estes represents an attempt 
to resolve the directive-nondirective 
controversy by conceptualizing the 
counseling process as one requiring 
“The Collaboration of Client and 
Counselor"; the counselor plays two 
distinctly different roles in two phases 
of the counseling process. In the 
first, he should be truly nondirective, 
but after the client accepts respons! 
bility for his own active participation 
in solving his problems, the counselor 
may appropriately change his role to 
that of an expert collaborator in the 
actual process of problem solving: 
Preliminary research findings are 
presented to support the validity of 
this orientation. 

Chapters 5 and 6 are both analyses 
of psychotherapy as a learning 
process. In the former E. J. Shoben 
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suggests that in therapy the patient 
learns a new "comfort reaction" to 
replace his anxiety reaction and that 
these new reactions generalize suffi- 
ciently to extraclinical situations to 
permit the patient to develop new 
and more appropriate instrumental 
social acts. In the latter, Mowrer de- 
fends his modifications of Freudian 
theory on the basis of his two-factor 
theory of learning. 

Chapter 7 is a somewhat modified 
version of Milton Wexler's previously 
published interesting account of his 
two years of therapy with a severely 
disturbed schizophrenic patient and 
a statement of the modifications in 
psychoanalytic treatment which he 
believes necessary for successful ther- 
apy with such patients. In general, 
his suggestions for revision of theory 
and practice seem congruent with 
those proposed by Mowrer for the 
treatment of neurotics, which perhaps 
accounts for the inclusion of this 
Paper on schizophrenia in a book 
otherwise dealing exclusively with 
neurotic rather than psychotic be- 
havior. 

Chapter 8 is a discussion of the im- 
plications of psychotherapy for public 
ea by Joseph Bobbitt and John 
Ben, with especial emphasis on 
the parallels between psychotherapy, 
group activities, and other education- 
E. tnde Which show some promise 
ui ing effective both therapeutically 
"m prophylactically in the mental 

ealth field. 

Looking next at Part II, we find 

iA a brief but useful summary by 
of m Seeman and Nathaniel Raskin 
ys. research methods and results 
och. out of "client-centered re- 
E uta therapy." Chapter 10 isa 
ia o Dollard and Mowrer's 1947 
chet describing the Discomfort- 
is et quotient as a method of meas- 

ng tension in written documents; 
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this serves as an introduction to 
Chapter 11 (Mowrer, J. McV. Hunt, 
and L. S. Kogan) summarizing fur- 
ther studies utilizing the DRQ; it ap- 
pears to be significantly related to 
therapists' estimates of therapeutic 
success and severalobjective measures 
of personal change during therapy. 
In Chapter 12, Fred Fiedler presents 
an integrated summary of his studies 
on the attitudes and feelings of thera- 
pists of different theoretical persua- 
sions and the role of these feelings in 
the therapeutic relationship. 

The next four chapters deal with 
the application of relatively new sta- 
tistical procedures to clinical re- 
search. Chapter 14, written by 
Mowrer, is entitled *Q-Technique— 
Description, History and Critique." 
In this reviewer's opinion, this schol- 
arly 60-page methodological mono- 
graph is the most valuable contribu- 
tion of the entire volume. In it, 
Mowrer begins by illustrating recent 
applications of the technique in re- 
search on psychotherapy, traces the 
history of the technique and its rela- 
tion to conventional factor analysis 
and other alternative research de- 
signs, and makes a clear-cut distinc- 
tion between Q technique as a method 
of data collection (which he proposes 
to call S technique) and as a method 
of data analysis. Although agreeing 
that these "statistics of the individ- 
ual” hold much promise for research 
on psychotherapy, he also reminds us 
of their limitations and pitfalls, unless 
used in the context of the alternate 
analytic procedures suggested by Cat- 
tell's “covariation chart" and by fac- 
torial design, in which case there 
seems to be real hope of integrating 
"experimental" and “clinical” types 
of research. 

In Chapter 14, Lee Cronbach con- 
tributes a briefer and more general 
discussion of ‘Correlations between 
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Persons as a Research Tool” and its 
relations to other methods of analyz- 
ing complex data. Chapter 15, by 
Lester Luborsky, describes the “P” 
techniques (correlation of repetitive 
measures of the same person) and 
demonstrates its use with a patient 
in the course of psychotherapy. In 
Chapter 16, L. L. McQuitty describes 
still additional methods of evaluating 
“Personality Integration” and sug- 
gests their potential utility in clinical 
research. 

Chapter 17 is another 80-page 
“monograph” by Mowrer. In this he 
first develops the thesis that neurosis 
may usefully be regarded as a ''dis- 
turbance in communication" and that 
psychotherapy is essentially a matter 
of assisting the patient in re-establish- 
ing good channels of communication 
between different aspects of himself 
and with others. He then reviews an 
impressive series of empirical studies 
relative to language, psychopathol- 
ogy, and psychotherapy and suggests 
the use of several procedures (e.g., 
Osgood's "semantic differential") as 
promising methods for research on 
therapeutic change. 

Chapter 18 is an even larger chapter 

"—nearly 100 pages—entitled ‘‘Ten- 
sion Changes during Therapy with 
Special Reference to Resistance," by 
Mowrer and three of his students 
(B. H. Light, Zella Luria, and Mar- 
jorie Zeleny). In the theoretical part 
of the chapter, the authors begin by 
reviewing the Freudian formulations 
regarding resistance and attempt a 
resolution of certain alleged "'para- 
doxes." They next review and eval- 
uate Rogers' views on resistance 
and conclude that the phenomenon 
is of sufficient importance both 
theoretically and practically to jus- 
tify the development of methods per- 
mitting its objective measurement 
during the course of therapy. The 
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major portion of the chapter is 
voted to reporting the developm 
of a simple and practical tech 
measuring “palmar sweating" 
enough preliminary data to ind 
its promise as an index of tensi 
hence, perhaps, of resistance. 
final brief chapter, written by E 
O' Kelly, is devoted to a survey of the — 
possibilities and limitations of other — 
physiological measures as indices of - 
change in therapy. EU 

This then is the content of the 
book. It reflects a tremendous a 
and variety of activity on the partot i 
agreat many psychologists. And, al- 
though the volume is heavily - 
weighted with the thinking and To 
search methods of Mowrer and his 
colleagues, a reasonable effort was 
made to include at least samples 
research by persons with other orien- 
tations. This together with the rela — , 
tively undeveloped and unorganized 
state of knowledge in the field resulted 
in a volume perhaps better characters: 
ized as a “Contributions to..." Of | 
"Readings in..." than a "text — 
book" or “treatise” as its title tends 
to imply. Certain of the chapters 
might, in this reviewer's judgment, 
have been better published as sepa- 
rates since they bear but little relation 
to the major theme of the volume. 
An integrative summary chapter 
would have been helpful, but perhaps 
unnecessary, to emphasize the fact 
that we are not yet ready for such an 
accounting. 

The major criticisms that may be 
made of this volume are those which — 
are inherent in the present undevel- 
oped state of an art which may some- 
time become an applied science. One , 
finishes the book with such questions 
as: ‘‘What is psychotherapy anyhow? N 
“Ts it effective—and if so under what 
conditions and with what kinds of , 
clients or patients?" Even when re j 


art o 


"T 


BOOK REVIEWS 


search results are presented, the read- 
er has difficulty in evaluating them 
' becattse our present state of knowledge 
does not permit an adequate descrip- 
tion of either the nature or intensity 
of the presenting problem. Further- 
more, in view of the extremely wide 
vafiety of activities subsumed under 
the rubric "psychotherapy," the 
reader is at a loss (unless he can read 
an entire series of transcripts) to 
know what really occurred in the 
treatment hours. Finally, as this book 
so well shows, the problems of de- 
veloping criteria are most complex. 
For the most part, the authors of 
these contributions are fully cog- 
nizant of these major problems but 
confident that they are amenable to 
research. 

Whether or not the reader agrees 
with Mowrer's theoretical orienta- 
tion, he will find this a challenging 
and useful series of contributions to 
an increasingly important topic in 
psychology. It is a book which no 
one planning any kind of research on 
any aspect of psychotherapy dares 
to overlook. I predict that but few 
persons will read all the chapters in 
sequence, but that equally few will 
fail to find several of the chapters ex- 
tremely valuable. 

, E. LOWELL KELLY 

University of Michigan 


Lacey, OurveR L. Statistical methods 
in experimentation. New York: 
lan. 1953. Pp. xi+249. 


Linngutst, E. F. Design and analysis 
of experiments in psychology and 
ea New York: Houghton 

ifflin, 1953. Pp. xix-+393. $6.50. 


WALKER, H. M., & Lev, J. Statistical 
Berner. New York: Holt, 1953. 
P. XId-510. $6.25. 


Depending on one's point of view, 
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it may be either distressing or satisfy- 
ing to note the increasing extent to 
which training in statistical methods 
has invaded the curricula of psycholo- 
gists in recent years. Little more 
than a decade ago the typical Ph.D. 
in psychology was generally consid- 
ered to have a respectable statistical 
repertoire if in addition to histograms 
and polygons he had acquired an 
assortment of measures of central 
tendency and variability, manipula- 
tions of the normal curve, linear cor- 
relation and regression, and the criti- 
cal ratio. Sophisticated indeed were 
students who had mastered the intri- 
cacies of the 1 test, chi square, and 
partial-multiple correlations. With 
due respect to Truman Kelley, the 
beginning of the modern era, charac- 
terized by the continuous translation 
of newer statistical logic and proce- 
dures for the consumption of mathe- 
matically naive but eager psycholo- 
gists, can probably be dated to 1940, 
when Lindquist's Statistical Analysis 
in Educational Research and Peters 
and Van Voorhis’ Statistical Proce- 
dures and Their Mathematical Bases 
made their appearance. 

Since 1940, and especially in the 
last five years, a number of "ad- 
vanced” statistical texts designed for 
psychologists have been published, 
and there has been a remarkable in- 
crease in the number of statistical 
articles and reports of complexly de- 
signed research studies. Also, the 
number of statistics courses offered in 
many graduate schools has shown a 
concomitant rise. In some schools 
emphasis on statistics has become so 
strong that students are urged toen- 
roll in courses in mathematical statis- 
tics and spend summers at statistical 
institutes. 

Thus, today the research psycholo- 
gist is expected to possess a compe- 
tence in statistical design, analysis, 
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and inference which can be produced 
only by devoting a substantial por- 
tion of the graduate program to 
courses in statistics or by motivating 
students to do a considerable amount 
of self-study. Perhaps unfortunately, 
the increased emphasis on statistics 
seems in part to have been accom- 
panied by a decreased emphasis on 
psychophysics and measurement the- 
ory. If psychologists should also be 
competent applied statisticians—and 
this is debatable in view of the trend 
toward teamwork in research—it 
would seem that a partial solution to 
the problem lies in selling the idea 
that students should acquire more 
relevant mathematics at the under- 
graduate level rather than simply in- 
creasing the statistical load on gradu- 
ate students. 

Lacey's book represents an attempt 
to bring together the basic concepts 
and techniques of modern statistical 
inference in a form suitable for the 
first course in statistics. The main 
trouble with this excellent little book 
is that it is so completely introduc- 
tory. Beyond a compact discussion 
of the rationale of experimental de- 
sign and inference, the specific topics 
touched upon are the binomial ex- 
pansion, the normal curve, the / test, 
chi square, simple correlation and re- 
gression, and the elements of fiducial 
limits. It is hard to judge where this 
book might fit into the program for 
the training of psychologists. The 
material appears to be too scanty and 
pitched too low for a full-fledged 
graduate course. However, at the 
undergraduate level one might ven- 
ture to replace the usual semester in 
reductive and descriptive statistics 
with a course focusing immediately on 
statistical inference. 

Perhaps the main utility of Lacey's 
book will be to serve as a reference 
text for courses in experimental psy- 
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chology. In view of the current status 
of psychological research design, one 
can seriously question Lacey's claim 
that “when you complete the text, 
you should be able to read and under- 
stand over 80 per cent of the experi- 
mental articles in your field, and 
should also be capable of designing 
and executing many of the experi- 
ments which you yourself may wish 
to try" (p. viii). 

Lindquist’s book furnishes an 
amazingly detailed account of both 
the theory—reduced to algebra—and 
the principal applications of analysis 
of variance and covariance in educa- 
tional-psychological research. Few 
psychologists will complain of a pau- 
city of material in this book, which 
attempts to do for psychological ex- 
perimentation what Cochran and 
Cox’s Experimental Designs has done 
for biology and agriculture. Closely 
written but at the same time deliber- 
ately repetitive in the interest of good 
pedagogy, loaded with algebraic 
proofs and discussion of assumptions 
and pitfalls, illustrated with an abun- 
dance of actual experimental situa- 
tions in both example and exercise, this 
book should make a major contribu- 
tion to the quality of psychological 
research. There is easily sufficient 
material for a year’s course in experi" 
mental design and analysis. Equally 
important, this will be a valued hand- 
book for anyone engaged in psycho- 
logical experimentation. ; 

Although the entire book is unique 
in view of its exhaustive treatment 0 
a multiplicity of useful designs, °S- 
pecially noteworthy are the chapters 
devoted to groups-within-treatments 
designs, controlling individual differ- 
ences in factorial experiments through 
the use of “mixed” designs, and tests 
concerned with trend. The material 
on interpretation of interactions an 
the selection of valid error terms 1 
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developed with outstanding skill. 

This comprehensive and scholarly 
book would have been even better if 
a more adequate job had been done 
in referring to the relevant literature. 
The book is so self-contained that it 
tends to become dogmatic about a 
number of controversial issues. No 
book of this type should fail to con- 
vey the important concept that it is, 
after all, only a selective progress re- 
port on the never-ending search for 
mathematical models that approxi- 
mate the relationships of observed 
data. This reader would gladly have 
sacrificed a number of the proofs in 
favor of a discussion of the problems 
encountered in dealing with non- 
orthogonal data. Although the sec- 
tion on Type II errors was admirably 
presented, it was a surprise to read 
the statement saying that the basis 
for testing Type II errors has been 
worked out only for simple random- 
ized experiments involving two treat- 
ments (p. 71). 

It was real joy to note the appear- 
ance of the long-awaited book by 
Walker and Lev. This book is a 
major contribution to the statistical 
training of psychologists. Just as 
Lindquist's book is unique in its 
ment of analysis of variance, 
\ alker and Lev have done a unique 
job jin surveying the total field of 
Statistical techniques useful to psy- 
chologists, In contrast to Lind- 
Quist'S decision, the authors in this 
Case make no attempt to supply 
ae for procedures, but content 
T nore with ample discussion of 

he d of underlying assumptions. 
Geant iographies at the end of each 
al are excellent, and the text 
Eon lut provide strong motiva- 
tited. ollow-up of the literature 
pie again is sufficient material for 

‘ast a full year's graduate course 
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beyond the introductory level. The 
combination of Walker and Lev with 
Lindquist might conceivably be com- 
pressed into a minimum of three se- 
mesters. Interestingly, Walker and 
Lev provide considerable material in 
the area of analysis of variance and 
covariance which is not presented by 
Lindquist. 

Mere citation of the headings of the 
18 chapters in Walker and Lev's 
book would, with the exception of the 
final chapter on '"Non-Parametric 
Methods" written by Lincoln Moses, 
give little hint of its comprehensive 
coverage. Traditional material is 
presented in the context of the most 
modern of statistical logic, but in 
addition every chapter contains the 
bonus of an excellent selection of re- 
cently developed techniques for deal- 
ing with common problems that 
arise in the analysis and interpreta- 
tion of experimental results. An ex- 
tremely useful set of 24 tables anda 
glossary of symbols are appended. 

In order to fulfill his function as 
critic, the reviewer feels obliged to 
make at least one criticism of Walker 
and Lev's book. In the section deal- 
ing with the comparison of two pro- 
portions based on the same individu- 
als (p. 102), it would have been most 
appropriate if the reference had in- 
cluded McNemar's 1947 article as 
well as the article by Cochran pub- 
lished in 1950. 

It cannot be stressed too strongly 
that the texts by Lindquist and by 
Walker and Lev are by far the best 
ones available in their respective 
areas of coverage. Both are based on 
the assumption that a statistical text 
for graduate students should not be 
a mere cookbook devised for spoon- 
feeding. Obviously these authors be- 
lieve that research workers in psy- 
chology and education should be 
competent applied statisticians. The 
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reviewer concurs and hopes that 
teachers of statistics will have the 
courage to adopt these two books 
widely so that potential research will 
benefit accordingly. 
LEONARD S. KOGAN 
Institute of Welfare Research, 
Community Service Society of 


New York 
Lenman, Harvey C. Age and 
achievement. (Published for the 


American Philosophical Society.) 
Princeton: Princeton Univer. 
Press, 1953. Pp. xi+359. $7.50. 


Lehman's topic is one of urgent 
concern to psychology and to society 
in general. Since age is an inescapa- 
ble attribute of every person, it has 
been an ever-present vector in deter- 
mining human behavior. In view of 
the fact that human longevity has 
greatly increased in the recent past 
and appears likely to increase further 
in the near future, better knowledge 
of the role of age has become progres- 
sively more critical. 

Age and Achievement assembles in 
convenient form data which Lehman 
and his collaborators have been col- 
lating for more than two decades. 
Nearly all of Lehman's previous con- 
tributions apparently have been 
incorporated here, although the vol- 
ume makes no explicit statement on 
this point and provides no complete 
bibliography of Lehman's publica- 
tions. The book consists of earlier 
research reports partially rewritten 
for this presentation and supple- 
mented by some new data. 

The majority of Lehman's re- 
searches have been directed to the 
production of creative works, such as 
paintings, poems, musical composi- 
tions, inventions, and scientific dis- 
coveries, as a function of age. His 
general finding is that while master- 
pieces and “best works" may be 
produced at any time during the ma- 
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ture years, in many fields the peak 
of productivity is reached in the 
thirties and forties, if not before 30, 
The number of contributions per 
decade per surviving contributor is 
greatly reduced in the later years. 
Thus, the period between ages 50 and 
70 has been responsible for a very 
small part of the world's best work, 
except in certain fields such as biog- 
raphy and history. During the fifties 
the production of "best works" per 
living creative worker in many spe- 
cialties is only 20 per cent or less of 
what it was in the thirties, and later 
productivity is even less. If weextra- 
polate from Lehman's data, it ap- 
pears that the imminent increase in 
the average life span from age 70 to 
age 80 or beyond will result in an in- 
consequential increase in creative 
contributions of first-rate impor- 
tance. i 

These findings, if taken at their 
face value, are of great importance, 
and careful attention should be given 
to their interpretation. Most of the 
remainder of this review therefore will 
be devoted to an examination of this 
question. No lack of confidence in the 
accuracy of Lehman's data is impli¢ . 
His data seem to us to be impressive 
in their objectivity, their quantity, 
and their self-consistency. We are 
concerned solely with interpretation 
Lehman himself is most cautious in 
this regard; it is hoped that others 
will be equally restrained in regard to 
their generalizations. 

First of all, it is necessary to keep 
in mind that nearly all of Lehman $ 
studies of creative contributions are 
concerned not with the aggregate 0 
creative products but only with "best 
works" and “masterpieces.” This 
fact introduces into Lehman's studies 
inherent limitations which should be 
recognized. By definition no man ca? 
produce more than one ''best work. 
If this product appears early !n the 
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life span, any number of later contri- 
butions of nearly equal merit will not 
affect analyses such as Lehman'sre- 
gardless of longevity. The compara- 
tive frequency of masterpieces at dif- 
ferent ages therefore probably pro- 
vides a very inaccurate index of the 
relative value of work done in differ- 
ent decades. Lehman's studies may 
show that man's "best" work is fre- 
quently done fairly early in life, but 
the relative value of other work is 
unassessed. 

Lehman's finding that master- 
pieces in a given field are often pro- 
duced in the thirties may properly be 
likened to a report by an anatomist 
or physical anthropologist that the 
maximum height of man is most often 
reached during this decade. Such a 
report, while interesting, would give 
us no measure of the degree of de- 
crease in height which occurs in the 
later decades. It is true that Leh- 
man's data show that older men pro- 
duce fewer works reported to be of 
high value than do men in their thir- 
tes. But whether the value of the 
Septuagenarian as a creative worker 
relative to the 30-year-old is 99 or 50 
9r 10 per cent cannot be determined 
by studying masterpieces. 

Although Lehman's primary con- 
cern is with the relative frequency of 
the notable contributions at different 
ages, in a few instances he presents 

ata on the total contributions, re- 
gardless of value, made in different 
«cades of life, In contrast to the 
est” contributions, the total num- 
NA creative works show little de- 
Rr age. This difference be- 
tivit t 4 age trend for total produc- 
Sale and the age trend for highest 
that di productivity is clearly one 
eserves additional analysis in 

€ future, 
bag erecti “best works” for study, 
trod n has carefully avoided the in- 
Uction of any bias of his own into 
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his data. Lists of the outstanding 
achievements in any field have, in 
each case, been made by others, not 
by Lehman. But this procedure, 
while commendable, does not avoid 
biases which may have been present 
in the evaluations of anthologists 
and historians, whose decisions are 
utilized by Lehman. Thus, if antholo- 
gists tend to choose the earlier of two 
works of almost equal merit produced 
by the same creative worker, this 
tendency would obviously produce an 
overestimation of the productivity of 
masterpieces in the early decades of 
life. We have no evidence concerning 
the presence or absence of a bias 
favoring selection of early works on 
the part of editors, compilers, biog- 
raphers, etc. 

Some of the anthologies and histo- 
ries used by Lehman as sources of data 
were published shortly after the death 
of many of the men whose works are 
evaluated therein. There exists the 
possibility that historians and editors 
may hesitate to include the later works 
of a recently deceased man because of 
the difficulty of judging recent con- 
tributions. It is our impression that 
hesitancy in making judgments con- 
cerning the recent past is common 
among historians. If this is true, it 
would lead to low scores in later ma- 
turity for the men who died only a 
few decades prior to the publication 
of a source book or history. 

In some analyses Lehman treats 
separately men of former centuries 
and men who lived recently. Men of 
former centuries show less decrement 
with age than do recent men. This 
fact is susceptible to the interpreta- 
tion that there is a bias against the 
citation of a man's later contribu- 
tions, on the part of authors writing 
soon after his death. 

Further influences to be considered 
in interpreting Lehman's data are the 
possible changes in standards of com- 
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parison that may take place in the 
course of a life span. Each successive 
quarter of a century within the past 
200 years has witnessed a great in- 
crease in the ranks of creative work- 
ers and in the number of entries in 
lists of notable contributions. This 
means that a man in recent centuries 
has had many more competitors for 
honors in his field at age 75 than he 
had at age 25. For example, in our 
own field of psychology, a man pub- 
lishing today who wishes his work to 
be noted is potentially competing 
with a large proportion of more than 
11,000 other psychologists. Had he 
been writing 50 years ago, he would 
have been competing with a total 
population of scarcely more than 100. 
A comparable change has occurred in 
most other fields. As suggested 
above, this great increase in number 
of workers introduces the possibility 
that the standards for “‘best works" 
in various fields have been progres- 
sively raised. Lehman’s data may 
reflect not age changes but, instead, 
changes in the requirements for recog- 
nition that have occurred in the life 
span. Analysis of data for an era not 
characterized by an increase in the 
number of creative workers might 
give quite different “age” trends. 
Evidence on this point is lacking and, 
indeed, may be impossible to obtain. 
But the absence of such evidence 
should make us wary of interpreting 
Lehman's curves as representations 
of changes in the individual during 
the age continuum. 

We have noted previously that 
Lehman presents data showing that 
Scientists, inventors, authors, and 
other creative men born a century or 
more ago yield age curves, in regard 
to "important" works, somewhat 
different from the age curves pro- 
duced by men born more recently. In 
general, the men of earlier centuries 
maintained a better record in later 
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maturity than have men born more 
recently. Several interpretations are 
possible. One was suggested earlier 
in this review. Another is that stand- 
ards of comparison in former times 
changed less drastically during the 
course of a man's lifetime than they 
do today. > 

Lehman has done a major service 
to psychology by assembling a multi- 
tude of facts relative to age and 
achievement. There are limitations 
inherent in his data, and we have 
indicated factors other than age 
which may be responsible in part, or 
in toto, for his “age” curves. Never- 
theless, he has supplied the data 
which are fundamental to further 
studies. As is usual with those who 
pioneer in new fields, Lehman has 
posed more problems than he has 
solved. Lehman’s own discussions, 
as well as the fine foreword by Ter- 
man, provide an excellent précis of 
the further research problems in re- 
gard to age and achievement which 
await solution. 

Wayne DENNIS 
Brooklyn College 


Gray, J. STANLEY. Psychology in in- 
dustry. New York: McGraw-Hill, 
1952. Pp. vii+401. $5.00. 


There has been a tendency in the 
past few years to rush the publication 
of textbooks in industrial psychology: 
This is true of texts in readings !n 
industrial psychology as well as 9 
those dealing with human and indus- 
trial relations, As a result, the selec- 
tion of topics and the thoroughness 
with which they have been written 
are somewhat haphazard. Gray 
text is no exception. à 

The book is written in a strictly 
logical order. In Chapter 1, the au- 
thor introduces the reader to some 
basic concepts in industrial psychol- 
ogy and then proceeds to consider the 
job and its basic requirements. Pe 
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considers next the usual areas of 
study in industrial psychology, ies 
training, motivation, morale, acci- 
dents, etc. A chapter on the age of 
workers is also included. Orthodox 
presentations of these topics are 
made. Little attention is given to 
newer applications of social psychol- 
ogy to industrial personnel problems. 

Chapters on methods of work, 
wages and job evaluation, nutrition 
and rest, and lighting and ventilation 
are also included. It appears that 
these areas should be left to the indus- 
trial engineer and more time spent on 
the basic psychological problems of 
industry. A text is needed which will 
start out by stating the basic objec- 
tives and functions of the industrial 
psychologist. Such an approach 
would help the beginning student and 
the professor who has to teach the 
course. Moreover, emphasis needs to 
be placed on methods and procedures 
through which industry might bene- 
fit in worker efficiency. 

Gray has written a textbook which 
provides the basic fundamentals and 
concepts of the field of industrial psy- 
chology. In this respect the book is a 
good compendium of facts. The text, 
however, falls short in the presenta- 
tion of the real problems of the indus- 
trial psychologist. 

D. J. Morri& 

North Carolina State College 


DANIEL, RonEnTS., & Louttit, C. M. ` 


Professional problems in psychology. 
New York: Prentice-Hall, 1953. 
Pp. xi+416. $5.50. 


. "Professional" is used broadly to 
include the problems of the teacher, 
E worker, and scholar in psy- 
chology as well as the problems of the 
Morsologiat who earns his living in 
f nical, industrial, or some other 
orm of psychological practice. For 

€ members of that broad audience 

aniel and Louttit have collected a 
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surprisingly varied amount of infor- 
mation, much of it of course from other 
sources, but in good part original. 

Do you want to know the names 
and publication dates of bibliographi- 
cal sources to psychological or general 
literature, the address of a publisher, 
where you can get a fractional horse- 
power motor, an electronic regulator, 
or a test? Are you interested in a 
definitive list of the 331 psychological 
journals that have been published at 
one time or another, in an authorita- 
tive guide for abbreviating journal ti- 
tles, or in information on the prepara- 
tion of various kinds of manuscripts? 

Do you want to know who attended 
the first APA meeting, how many 
members the Association had in 1943, 
where and when the first Interna- 
tional Congress of Psychology was 
held, or the names of the four psycho- 
logical journals that were published 
before 1800? Or do you wanta list of 
state psychological associations, in- 
formation about ABEPP examina- 
tions, or a discussion of public rela- 
tions in psychology? 

This sampling gives an idea of the 
wide scope of information contained 
in the four sections and four ap- 
pendices which make up the book. 
The first section is introductory. 
Then follow series of chapters on 
psychological literature, reporting 
psychological research, and problems 
of the professional psychologist. The 
appendices are an annotated list of 
reference books of value in psychol- 
ogy; à bibliography of journals in 
psychology; a list of sources for 
books, tests, apparatus, equipment, 
and supplies; and a glossary of ab- 
breviations useful to the psychologist. 

The inclusion of sections on psy- 
chological literature and the reporting 
of psychological research may come 
as a surprise to some readers who 
interpret the title as dealing only 
with licensing, ethics, interprofes- 
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sional relations, and other such “‘pro- 
fessional" topics. But the inclusion is 
a valuable one, and the authors are 
well qualified in these areas. Louttit, 
in particular, has long been one of 
psychology's experts on psychological 
literature. Both of these sections are 
necessarily condensed, for they cover 
alotofground. But both are authori- 
tative. The final section, on problems 
of the professional psychologist, is 
also brief; it too covers a lot of 
ground, but the nature of the material 
does not permit an authoritative 
treatment. Psychologists have not 
yet reached full agreement on many 
of the problems discussed in this sec- 
tion, and the very nature of the prob- 
lems makes it impossible for the au- 
thors to be as definitive as they could 
be in the earlier sections. The major 
problems are introduced ; some of the 
principal points of view are outlined; 
and references to fuller discussions 
are plentiful. 

For the graduate student the book 
consists of a thoughtful compendium 
of advice about collecting a working 
bibliography of source materials, 
about the publication policies and the 
kinds of articles accepted by each of 
a number of journals, about the prepa- 
ration of a manuscript and how to see 
it through the press, about ethics, 
interprofessional relations, psycho- 
logical organizations, the teaching of 
psychology, and other problems that 
he will encounter as a professional 
psychologist. For the mature psy- 
chologist the book's chief usefulness 
will be as a reference volume. He 
already knows some of the things in- 
cluded, so can browse through it to 
get acquainted with its coverage. 
Then it belongs on the handiest shelf, 
for it will be a frequently consulted 
reference source. 


DAEL WorrLE 


Commission on Human Resources 
and Advanced T) raining 
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Canor, Hucn, & Kant, JOSEPH A. 
Human relations. Cambridge, 
Mass.:Harvard Univer. Press, 1953. 
Vol. I. Concepts. Pp. xxxi+333. 
$4.75. Vol. II. Cases. Pp. viii 
+273. $4.25. 


These two volumes by Cabot and 
Kahl constitute the fruits of their 
efforts to study and teach human rela- 
tions at Harvard University during 
the last decade. Simply stated, their 
approach involves two fundamental 
views: (a) that effective face-to-face 
human relationships can only emerge 
from a knowledge derived from the 
application of fruitful social science 
concepts to concrete social situations. 
Ostensibly, a full „understanding of 
these relationships requires not only 
meaningful conceptual tools but also 
the skills necessary for their utiliza- 
tion in specific social settings. And 
(b) that the communication or learn- 
ing of the concepts and the skills 
necessary for their application can 
best be achieved by the medium of 
group discussions of case material 
representing concrete social situa- 
tions. 

Volume I is devoted almost exclu- 
sively to the interpretation and elab- 
oration of selected social concepts, 
whereas Volume II consists of 33 
cases of social interaction in face-to- 
face situations. The presentation in 
Volume I takes the form of a series of 
discussions by the authors in which 
the theoretical views of various prom- 
inent social theorists and practition- 
ers are summarized and, to some €x- 
tent, integrated. At the end of each 
discussion, i.e., chapter, many of the 
writings from which these views were 
taken are presented as a series of read- 
ings. In some cases complete articles 
are reproduced and in others perti- 
nent selections from books are given. 
Approximately two-thirds of the first 
volume consists of these readings. 
Because the thinking of so many 
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theorists is involved, and inasmuch 
as many of these views have been ex- 
tensively reported on and evaluated 
in the literature, critical evaluation 
of specific theoretical approaches will 
not be attempted. As a source book 
of human relation concepts intended 
for the student and nonacademic 
reader, the presentation of these con- 
cepts in terms of clarity and level of 
discussion becomes the critical ques- 
tion. 

It is clearly evident that an at- 
tempt to understand human interac- 
tion in specific face-to-face situations 
implies an idiographic or "clinical" 
approach. The first two chapters of 
Volume I are devoted to a discussion 
of the nature and difficulties involved 
in such an approach. Here the au- 
thors stress the importance of theo- 
retical generalizations in understand- 
ing the unique event as well as a con- 
sideration of all the factors in the 
situation and of the relationships 
among these factors. They caution 
against distortions in social observa- 
tions introduced by “habits of 
thought and language." In effect, the 

iscussion summarizes the views of 
A. J. Lowell, L. J. Henderson, C. H. 
Cooley, S. I. Hayakawa, and others. 

Chapters III to VII deal with 
concepts relevant to understanding 
the behavior of the individual in rela- 
tion to others. Most of the theoretical 
conceptions considered have been 
cited extensively in psychological and 
Sociological literature and thus need 
only be cited in passing. Chapter 

II, entitled “Cultural Values and 
E Roles,” draws heavily upon 
bii of Durkheim (**Anomie"), 
hd Kluckhohn ("Dominant and 
ü int Cultural Value Orienta- 
ons ) Linton (“Status and Role"), 
benedict, (“Continuities and Discon- 
ulties”), Parsons (“Age and Sex in 
ial Structure”), and Warner and 
unt (“Social Class and Social Struc- 
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ture"). A discussion of the conflict 
between the need to belong and the 
individual's desire for independence 
in Chapter IV briefly summarizes the 
thinking of Erich Fromm. A number 
of the writings of Piaget constitute 
the framework for a discussion of 
“Our Developing Social Interactions” 
in Chapter V. Chapter VI considers 
the nature of conscious and uncon- 
scious thought processes (Freud, 
Munford, etc.); and finally, Chapter 
VII is devoted to a discussion of the 
effects of anxiety and fear on social 
interaction (Mayo, Horney, etc.). 

The remaining six chapters are 
focused more or less on the problems 
of group structure and function. 
Chapters VIII and IX deal with 
“Group Membership" and “Group 
Processes” respectively (Cantril, W. 
F. Whyte, Davis, Homans, etc.). 
Chapter X deals with the problem of 
differences in group values reflected 
in the conflicts between racial and re- 
ligious groups in the United States. 
The writings of Rose, Myrdal, Lewin, 
Frenkel-Brunswik, and others form 
the basis of this discussion. The prob- 
lem of modifying values and feelings is 
discussed in Chapter XI. Here the 
importance of a nondirective ap- 
proach in a face-to-face situation is 
stressed as the primary condition for 
bringing about change. As one might 
expect, the writings of Carl Rogers 
form the basis of much of this treat- 
ment. The final two chapters, XII 
and XIII, deal with the problems of 
"Executive Leadership” and ‘‘Social 
Control and Equilibrium." 

It is readily patent that the con- 
cepts discussed in Volume I—as the 
authors themselves point out in the 
Introduction—are "interrelated con- 
cepts from several... disciplines." 
In various places the interrelatedness 
of some of the concepts is stressed, 
but, as the authors indicate, no at- 
tempt is made to put forward a 
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"tightly reasoned system of social 
theory." The order in which the con- 
cepts are presented rests on the as- 
sumption that effective action in a 
social situation must be preceded by 
skillful diagnosis. Thus, the discus- 
sions in the earlier chapters orient the 
individual toward the problem of 
social observation, while the later 
ones are more “‘action’’ oriented. 
Cabot and Kahl are by no means 
dogmatic about the order of concept 
presentation; they suggest that the 
order can and should be varied to 
meet the needs of particular groups. 

In the main, the selection of con- 
cepts and the order in which they 
are presented reflect very careful 
thought. In reading through Volume 
I, one cannot but be impressed by the 
authors' cogency in organizing social 
data. Yet, one rather serious criti- 
cism of the volume should be made. 
We have no quarrel with the particu- 
lar social concepts selected, but rather 
with the conspicuous absence of any 
systematic discussion of the biological 
contributions to personality develop- 
ment. True, the purpose of the writers 
was to provide the reader with a 
social theoretical framework for un- 
derstanding social interactions in con- 
crete situations. But they have 
ignored their responsibility of not 
leaving the student and nonacademic 
reader with the distorted view that 
human reactions are purely a function 
of sociocultural factors. Some pre- 
liminary discussion of the contribu- 
tions of essential structure and 
equipment in producing differences 
in intelligence, drive intensity, and 
the like would have gone a long way 
toward reducing the possibilities of 
such distortion. While it is critical 
for the reader to understand the im- 
portance of learning in the develop- 
ment of social interaction, it is no less 
important for him to understand the 
limitations imposed on such learning 
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in an individual with an IQ of 70. 
Perhaps the most disturbing aspect 
of Volume I is the level of presenta- 
tion. Both volumes are clearly in- 
tended for undergraduates in inte- 
grated social science courses and non- 
academic groups of various kinds. 
Somewhere along the line the authors 
lost sight of some, if not all, of their 
intended readers—insofar as Volume 
I is concerned. In general, their 
discussions tend to be abstruse and, 
in the reviewer’s opinion, will tax the 
comprehension of the average under- 
graduate and certainly individuals in 
many nonacademic groups. In part 
this is because, although the writings 
of many of the theorists demanded 
clarification and a simpler presenta- 
tion, Cabot and Kahl simply sum- 
marized them in much the same 
language. While various Steig car- 
toons presented throughout Volume 
I serve to suggest a concern with the 
communication of ideas, the actual 
discussions stand in sharp contrast to 
these ingenious drawings. A greater 
concern for the general reader by 
avoiding esoteric discussions would 
have gone a long way toward im- 
proving the value of the volume. — 
On the other hand, the selection 
and presentation of the case materia 
in Volume II are outstanding. The 
cases depict actual social interactions 
encompassing a wide variety of situ- 
ations. Inasmuch as they are to 
serve as the “raw data” that are to 
be organized in terms of the concepts 
presented in Volume I, they are 
straightforward factual accounts. 
They are presented in detail, with 
both background material and ver 
batim conversations and, more $i£- 
nificantly, without any attempt to 
illustrate particular theoretical points 
of view. Because their approach to 
increasing the individual's under- 
standing of human relations rests 0 
his learning through group participa- 
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tion, Cabot and Kahl selected those 
cases which they consistently found 
produced good group discussions. In 
effect, the cases have been “pre- 
tested." 

While the two volumes were de- 
signed as a unit, it is clearly evident 
that their use is not restricted in this 
respect. The cases can be employed 
with other concepts (or readings) 
than those presented in Volume I; 
and, of course, the concepts in Vol- 
ume I can be used in any order or 
with a different set of cases. As the 
writers indicate, "the chapters and 
cases are so written that they are to 
a considerable extent complete in 
themselves, and such modifications 
can be readily made.” 

The number of colleges and uni- 
versities in which integrated social 
science courses are being introduced 
into the undergraduate curriculum is 
on the increase. There is a desperate 
need for especially prepared written 
material for such a course for first- 
year college students. Both objec- 
tively reported case materials and 
succinct theoretical discussions for 
the undergraduate, lacking a back- 
ground of specialized courses, are 
required. With respect to the former, 
Cabot and Kahl have made a definite 
contribution. Unfortunately, the 
same cannot be said with reference to 
the latter. While some portions of 
their conceptual discussions in Vol- 
ume I probably can be used for inte- 
grated social science courses, on the 
Whole much of it falls far short of the 
Mark, 

HAROLD PROSHANSKY 

Brooklyn College 


Watine, Jonn W. M., & CHILD, 
Irvin L. Child training and per- 
sonality: a cross-cultural study. 
New Haven: Yale Univer. Press, 
1953. Pp. vi+353. $5.00. 


This is the report, within a specif- 
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ically defined theoretical framework, 
of a careful study in which 75 primi- 
tive societies are used as the individ- 
ual variables. The sources of infor- 
mation about these societies are pub- 
lished anthropological reports, and 
are given in a classified list of refer- 
ences. The aim of the study is to 
test certain hypotheses about the 
effect of child-rearing practices on 
producing characteristic personality 
structures of the members of a cul- 
ture. The authors have set up a series 
of hypotheses, and then tested them 
by applying to them available de- 
scriptions of the cultures concerned. 
These hypotheses are derived from a 
general behavior theory in which cer- 
tain aspects of Freudian psycho- 
analytic theory are adapted to Hull- 
ian behavior theory. 

Five systems of behavior are se- 
lected for study because they are con- 
sidered basic to personality develop- 
ment, are subject to training and con- 
trol in all cultures, and are treated in 
a wide variety of ways in the differ- 
ent cultures. These systems of be- 
havior are: oral, anal, sexual, de- 
pendence, and aggression. For each 
culture the data about each of the 
systems were categorized according 
to (a) initial indulgence, (b) age of 
socialization, and (c) severity of 
socialization. Within these categories 
the classes of behaviors were defined 
in such a way as to maximize the ob- 
jectivity of classifications, and to 
facilitate making useful distinctions. 
Three judges, working independ- 
ently, rated and ranked the data. 

The Freudian concept of fixation is 
considered basic, but it has been 
modified by division into positive 
and negative fixation. Evidence for 
both kinds of fixation is studied in 
each of the five systems. Positive 
fixation is expected to result from 
satisfactions experienced when drives 
in any behavior system are indulged. 
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Negative fixation should occur in a 
behavior system when training inter- 
feres with natural behavior tenden- 
cies. Several other concepts, such as 
"habit potential," ‘custom poten- 
tial" and "custom complex," were 
defined as important in dealing with 
the cultural variables. 

The personality characteristics 
studied are based on customs for 
which there was generally good infor- 
mation and for which, according to 
the authors' theory, there was ex- 
pectation of a relation to the child- 
training practices studied. These 
characteristics are the attitudes in 
a society toward illness, its causes 
and cures, and toward death. Cus- 
toms relating to illness were divided 
into: explanations of illness, perform- 
ance therapies, avoidance therapies, 
and fear of others. Each of these was 
in turn subdivided according to the 
five basic systems of behavior. The 
data on illness from each culture 
were then classified by two judges 
who were kept in ignorance about 
the child-training classifications. 

Interrelationships were studied 
among the various aspects of child- 
rearing practices, and between these 
and the personality characteristics of 
the adult members of the cultures, as 
evidenced in their customs in rela- 
tion to illness. In many cases the 
theoretical hypotheses were con- 
firmed or supported; in others the 
expected relationships were not found. 
The failure of verification of some 
of the hypotheses could be due to in- 
adequacy of the data, which are 
often lacking in information pertin- 
ent to this inquiry. The findings are 
interesting and present a very chal- 
lenging array of problems for future 
testing, by additional cultural stud- 
ies and by experiment. 

It is difficult to summarize the 
findings from the study, as they are 
so bound up with the context of in- 
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terrelating variables and theories, 
But some examples may be given, 
The authors feel justified in their 
separation of positive and negative 
fixation. Negative fixation was most 
clearly demonstrated: it was shown 
in "the effects of anxiety developed 
during the period of socialization for 
each system of behavior." Positive 
fixation seemed to be related to a 
later period of childhood and to anal 
and sexual satisfactions. ''Viewed as 
a whole, the evidence for a lasting 
effect of child training practices on 
personality is more striking and con- 
sistent for the three... systems— 
oral, dependence, and aggression.” 
There was some evidence that guilt 
arises in the process of socialization 
as a consequence of identification 
with the punitive parent. In a very 
ingenious treatment of the data re- 
lating to the origins of unrealistic 
fear of other persons (paranoia) the 
authors “found that the fear of others 
is primarily associated with anxiety 
about aggression." Although the cor- 
relational method does not permit 
determination of the direction of the 
cause-effect relation, the authors feel 
they are justified in concluding that 
child-training practices do at least in 
part influence adult personality. . 
An incidental finding of interest 15 
a comparison of child-training prac- 
tices in midwestern urban middle- 
class families and the primitive cul- 
tures. By comparison, the Americans 
were exceptionally early and severe 
in toilet training (anal), and quite 
severe in weaning (oral) and in gen- 
eral socialization (average of five 
systems). Only in the socialization of 
aggression are they close to the norm 
for primitive cultures. However, the 
evaluation of these tendencies should 
be made in relation to their place 1n 
the total cultural pattern, and the 
authors explicitly make no recom- 
mendations regarding them. 


_ This book has much to contribute, 
methodologically and theoretically, 
to the systematic treatment of the 
development of personality. But it is 
wery difficult to read. The findings 
ure buried and easily lost in the dis- 
cussion and the tedious (though of- 
—ten important) qualifications of theo- 
retical presentations. Perhaps this 
difficulty is unavoidable in working 
out this kind of complex general be- 
“havior theory. But it seems to this 
"reviewer that a simpler style could 
make the theory a little more readily 
‘available and generally understood. 

Nancy BAYLEY 

University of California, Berkeley 


Rosen, Joun N. Direct analysis: 

Selected papers. New York: Grune 

Em 1953. Pp. vii+184. 
SAY 


The regard, respect, and human 
Warmth for the mentally sick individ- 
wal that John Rosen conveys in his 
public appearances are well reflected 
in the nine papers which have been 
- collated into this slim volume. Eight 
i of these papers have seen prior pub- 
lication or presentation before psy- 
chiatric audiences, but the first 
Paper, "Direct Analysis: General 
Principles," was specifically prepared 
for this book. 

There are two basic premises from 
Which Rosen proceeds: first, that 
unless physical pathology can be 
clearly demonstrated, psychoses are 
PSychogenic in origin and are treat- 
able by psychologic means; and sec- 
ond, that the nuclear core of the psy- 
chotic tesponse is formed in the oral 
E of infant dependence upon the 
E ot er. The effectiveness of direct 

nalysis lies in the capacity of the 
a E to enter without fear di- 
E. Y into the patient's psychosis and 
3 „clarify for him the dynamics of 

IS oral deprivation by direct psy- 
“Aoanalytic interpretations as well as 
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by providing some of the absent 
love. Once reality contact has been 
firmly established, the therapy pro- 
ceeds along more traditional psycho- 
analytic lines. On the basis of his ex- 
perience, the author categorically 
asserts that he knows of no psycho- 
genic psychosis that cannot be signif- 
icantly improved by psychological 
therapy (i.e., psychoanalysis for Ro- 
sen). The author seems to have need 
for a rather uncritical, uncluttered, 
orthodox Freudian psychoanalytic 
theorizing to lend “scientific respect- 
ability" to his superb skill in inter- 
personal relationship therapy and to 
enable him to communicate to others 
what he believes to occur in the 
therapy situation. As an experi- 
mental method for studying the psy- 
chotic process, Rosen's techniques 
may prove extremely valuable. The 
limitations of application of this 
procedure to the massive problem of 
the psychotically ill are self-evident. 

The interview and case material 
provide a fair amount of source ma- 
terial on the practice of direct analy- 
sis. The book is well worth reading 
not only for its contributions to the 
psychopathology of psychosis but 
also for its quality as a personal 
document. 

M. ERIK WRIGHT 
University of Kansas 


VirELEs, Morris S. Motivation and 
morale in industry. New York: 
Norton, 1953. Pp. xvi+510. $9.50. 


Viteles has undertaken, in this 
volume, to synthesize the data and 
reconcile differences from the ever in- 
creasing investigations of the “will- 
to-work” in industry. He has done 
an acceptable job in terms of the 
limitations of the studies available 
and the diverse orientation toward 
the problems of motivation in indus- 
try that the investigators on whose 
work he relies have. In doing this he 
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has produced a much needed review 
of the literature in this area for re- 
search workers and a handbook for 
personnel-minded members of man- 
agement. 

The book is directed primarily to 
members of management who are 
responsible for the development and 
administration of personnel policies. 
The emphasis throughout is on ap- 
plications of research findings to 
problems of motivating employees. 
Viteles has developed a reasonably 
consistent set of principles for this 
purpose and has presented specific 
suggestions for their application. He 
also has pointed out inadequacies in 
available data and emphasized the 
need for further research. 

In addition to affirming the im- 
portance of security, participation, 
status, and role in employee motiva- 
‘tion, the author has re-examined the 
problem of economic motivation and 
has rescued it from the limbo to 
which it has been consigned by the 
Hawthorne investigation and certain 
attitude surveys. He has phrased 
properly the problem of financial in- 
centives versus human relations in 
the motivation of workers: employees 
want both, and the problem con- 
fronting management is an integra- 
tion of the two. 

Viteles relies for his data on in- 
vestigations classified roughly as ‘‘ex- 
perimental” studies and attitude 
surveys. Typical of the former is the 
Hawthorne investigation. Data from 
attitude surveys represent a wide 
range of approaches, from those of 
Houser in the late twenties to the 
General Motors MJ Contest and 
work of the Michigan Survey Re- 
search Center and the National In- 
dustrial Conference Board. Most of 
the studies have appeared within the 
last five years. The investigations, 
for the most part, were conducted in 
industrial situations. 
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In spite of the excellent synthesis 
of these investigations, this reviewer 
is concerned with Viteles’ failure to 
evaluate critically certain investiga- 
tions on which he relies for his con- 
clusions. It is true that he has an ex- 
cellent chapter on the weakness of 
attitude survey methodology. Yet re- 
sults of studies by the Opinion Re- 
search Corporation, Rose’s study of 
workers’ attitudes toward labor un- 
ions, and others are presented with- 
out making specific their limitations. 
Viteles cannot assume that the audi- 
ence for which this volume is primar- 
ily intended, managerial personnel, 
will be well enough acquainted with 
these investigations or sufficiently 
sophisticated to evaluate their re- 
sults without a specific statement 
from the author as to their limita- 
tions. In contrast, Viteles has been 
quite explicit as to the research weak- 
ness of the investigations dealing 
with group decision making in moti- 
vating employee behavior. y 

As indicated earlier, this volume 1s 
intended primarily for administrators 
concerned with personnel policy. 
This reviewer is somewhat appre 
hensive that the administrators who 
read this book will receive the idea 
that developing the will-to-work is 
primarily a matter of techniques 
rather than dependent on a funda- 
mental philosophy of the dynamic 
relationship between a man and his 
job. There is a wide discrepancy be- 
tween regarding “security” asa means 
of increasing worker productivity 
and regarding "security" as a Way 9 
living essential to the well-being © 
both the individual worker and his 
employer. The latter concept may, e 
implicit in Viteles’ treatment of in 
centives but it needs to be made ex 
plicit. í 

The author usually keeps in view 
the audience for which he is writing. 
The style is reasonably straightfor- 
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ward although the wealth of material 
cited at times results in the impres- 
sion that the volume is an annotated 
bibliography. Statistical terminology 
at times will confuse the lay reader 
especially since no explanations of the 
concepts are offered. 

Motivation and Morale in Indusiry 
has real significance for industrial 
psychology. It is the only compre- 
hensive and compact source of biblio- 
graphical data in this area available 
to psychologists working on indus- 
trial problems. Even more important, 
it emphasizes the centrality of the 
problem of motivation in industrial 
psychology. It is significant that 
Viteles has chosen to write on this 
subject rather than to revise his In- 
dustrial Psychology published in 1932. 
It is an indication that psychologists 
concerned with industry are now 
moving away from a primary con- 
cern with the peripheral problems 
of employee selection, training, vi- 
sion, and job analysis to the central 
problem of the work situation—the 
dynamics of behavior within the in- 
dustrial organization. 

s WILLIAM MCGEHEE 

Fieldcrest Mills, Inc. 


WALTER, W. Grey. The living brain. 
Ber York: Norton, 1953. Pp. 311. 


Starting with a delightfully writ- 
ten, simple story of the evolution of 
the nervous system and the emer- 
gence of the brain of man, Walter 
ends with some thoughts on the brain 
of tomorrow. He pays high tribute 
to Pavlov and Berger, whose con- 
he to conditioned reflexology 
a electroencephalography, respect- 
dm i are landmarks in our under- 
C» ing of the brain. He attempts a 
i pince of the notions resident 
brillian: seemingly diverse areas. A 
d iant thinker and writer, Walter 

oes not hesitate to highlight his 
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own role in the tremendous progress 
in electrophysiology in the last dec- 
ade or so. For one looking on de novo, 
the impression might be gained that 
electroencephalography has been 
brought to its present state of devel- 
opment by the author singlehandedly. 

The EEG and its elaborations by 
automatic analyzer and toposcope 
are looked upon as a suitable mirror 
for the brain’s physiological activi- 
ties, especially for detecting the sig- 
nificance of patterns which may cor- 
relate with the psychological phe- 
nomena of perception, learning, and 
personality. Certainly some interest- 
ing ideas are presented. The reviewer 
shares with Grey Walter the hopeful- 
ness of rapprochement in the realms 
of the physiological and psycholog- 
ical, but at best it will be a slow and 
laborious process. This is a book of 
ideas and hunches, flavored perhaps 
more with wish fulfillment than with 
empirical fact, but nevertheless an 
exciting account which psychologists 
might well read with profit. 

Walter is greatly impressed with 
the subjective effects of flicker stim- 
ulation, and like others sees in photic 
driving a hopeful device for manipu- 
lating brain rhythms in frequency, 
phase, and space distribution with 
the goal of testing the hypothesis that 
brain rhythms serve a scanning func- 
tion similar to that in television. 
Two chapters, six and seven, en- 
titled "Learning about Learning” 
and "Steps from Chance to Mean- 
ing," ought to prove of interest and 
value to the psychologist, for here 
elements of neurophysiology, com- 
munication theory, feedback, and 
scanning mechanisms are brought 
to bear on the perceptive-learning- 
memory process. Much of Walter's 
thinking on such matters will seem 
somewhat more cavalier than com- 
prehensive, and stems from analog- 
ical conceptions developed through 
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the construction of ingenious electro- 
mechanical models. The most elab- 
orate of these is Machina Speculatrix, 
a bug-like toy, which has built into 
it potentialities for responsive be- 
havior described as exploratory cur- 
iosity, sensory discrimination, nega- 
tive and positive tropism, and other 
properties called discernment and 
self-recognition. With such a device 
he has identified what he believes are 
seven steps from chance to meaning. 
'The circuit diagrams for a nerve 
model, M. speculatrix, and a condi- 
tioned reflex analogue are presented 
in appendices. All in all it is an inter- 
esting and provocative book. 
DoNarp B. LINDSLEY 

University of California at Los 

Angeles 


FEsTINGER, LEON, & Katz, DANIEL. 
(Eds.) Research methods in the behav- 
ioral sciences. New York: Dryden 
Press, 1953. Pp. xi+660. $5.90. 


Because of the large number and 
variety of variables that may affect the 
phenomena they study, psychologists 
have been forced to pay particular at- 
tention to problems of research design 
and techniques of observation and 
measurement. Research Methods in 
the Behavioral Sciences is devoted to 
the consideration of these problems 
as they are faced by the investigator 
who is concerned with human behav- 
ior in its social setting. 

The general attitude of the authors 
influences the organization and con- 
tent of the text in at least two differ- 
ent ways. In the first place the meth- 
ods discussed are conceived to apply 
widely to studies of social behavior 
regardless of the discipline—anthro- 
pology, psychology, sociology—to 
which an investigator formally claims 
allegiance. The emphasis is upon the 
type of question asked rather than 
upon the arbitrary division of the 
social sciences. It is for this reason 
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that the text has in its title the term 
“behavioral sciences." In the second 
place, the authors feel strongly that 
progress in the investigation of man's 
social behavior is most likely to be 
achieved by use of the basic logic of 
scientific methodology. This is the 
logic upon which research design in 
all sciences depends, hence, “... 
there are probably no social-psycho- 
logical methods as such." 

I would give an incorrect impres- 
sion to imply that this text is primar- 
ily a discourse on the logic of scien- 
tific method illustrated with exam- 
ples from research in the behavioral 
sciences. A distinction made by the 
authors between methods of research 
design and techniques by which a 
design may be put into effect is 
analogous to the military distinction 
between "strategy" and "tactics." 
Accepting that scientific logic is com- 
mon to all sciences, the editors em- 
phasize that“ . . . specific techniques 
and approaches will vary, depending 
upon the subject matter . . ." under 
investigation. It is in their tactics 
that the behavioral sciences differ 
from other sciences. E 

The most important contribution 
of this text lies in the systematic man- 
ner in which the techniques available 
to the behavioral sciences are pre 
sented. The text follows the sequence 
of operations confronting the re- 
search worker from the time the ques- 
tion he is asking has been clearly 
stated to the time when he is ready 
to present a report on his efforts to 
answer it. A major portion of the 
book is devoted to procedures for 
sampling a population to be studied, 
and to techniques of collecting an 
analyzing data. In many cases tec? 
niques are described in the detail 
which is usually associated with 2 
handbook. In this respect, the text 1s 
most useful to those who wish to 
check their own procedures as well as 
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those who are in the process of devel- 
oping new research skills. The latter 
should understand that just because 
the text is so systematic in its organ- 
ization does not mean that all meth- 
odological problems in social psy- 
chology have been solved. 

]t is understandable that, in at- 
tempting to be systematic, the au- 
thors have not always succeeded in in- 
troducing originality into their sub- 
ject matter. However, because they 
have attempted to be systematic, 
they have included discussions of 
matters of importance to scientific 
method which have received little 
attention in similar texts in the past. 
With a bit more daring—or, perhaps, 
more space—they might have ven- 
tured to present more fully the logical 
bases of different patterns of research 
design. They perform a valuable 
Service in discussing the basic as- 
sumptions and logical requirements 
that underlie the research techniques 
they are discussing. Problems of ob- 
Jective observation are considered in 
terms of the processes of hypothesis 
construction and of research design 
Which precede them in the sequence 
of steps through which the investi- 
gator must pass. Systematic observa- 
tion involves some form of measure- 
ment no matter how relatively crude 
it may be, and it is encouraging to 
find that a discussion of the theory of 
Social measurement leads to an em- 
pus on the valuable roles which 
eric methods of analysis 
À Y play in analyzing data of the 
ES Which most measuring instru- 

ents in the social sciences now 
provide, 

ne “Skea coverage of the 
NR ich characterizes this text 
e " through the final chapter, 
of the eric poige out that the role 
Ende: aer frequently does 
me with the completion of a re- 

ch project. Questions of applying 
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research findings arise, bringing with 
them problems of a technological 
nature which are themselves in need 
of research. Since little knowledge 
regarding such problems is now avail- 
able, the best that can be done is to 
discuss experiences in this branch of 
psychotechnology and to make a case 
for including such considerations in 
future research and training pro- 
grams. 

Research Methods in the Behavioral 
Sciences stands up well under the 
criticisms that are frequently leveled 
against texts to which a number of 
authors have contributed. The ex- 
cellent integration of its thirteen 
chapters suggests that very careful 
and detailed planning must have pre- 
ceded the actual writing. There is 
much less duplication of material 
than is usually found in such col- 
laborative endeavors. Although the 
results of a Flesch count—old style 
or new—are not available, the gen- 
eral level of readability appears to be 
good and within the grasp of ad- 
vanced undergraduate or graduate 
students. 

For their present purposes the 
authors have drawn upon “...a 
limited pool of methodological re- 
search findings, to which they have 
themselves contributed." There is 
never a doubt that the authors have 
other than the same general orienta- 
tion toward the field they are dis- 
cussing. With few exceptions they 
are affiliated at present, or have been 
in the past, with the development 
of social psychology at the Univer- 
sity of Michigan. This has advantages 
as far as the presentation of a coher- 
ent, well-integrated point of view is 
concerned, but in all probability it is 
a characteristic of the text which will 
bring comments from readers who 
would like to have seen material in- 
cluded that does not appear. 

A lesson for some to learn from a 
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thoughtful appraisal of this text is 
the unsatisfactory nature of attempts 
to define a science in terms of the 
subject matter it investigates. Psy- 
chology can claim to be a science in 
so far as psychologists employ scien- 
tific methodology, based, as it is, 
upon certain well-established prin- 
ciples of reasoning conforming to cer- 
tain strict requirements and accepting 
certain articles of faith. The authors 
have shown how the experimental 
and nonexperimental methods of 
science can be called to the aid of 
those who are interested in man's 
social behavior. This achievement 
will not come as a surprise to those 
who have followed the development 
of social psychology over the past 
number of years and know how force- 
fully and consistently Dr. Katz has 
pressed for careful attention to 
methodological problems. The pres- 
ent text illustrates that he and his 
associates have gone a long way in 
this direction, yet recognize that 
there is still a good distance to cover. 
RoGER W. RUSSELL 
University College, London 


SARASON, SEYMOUR B. Psychological 
problems in mental deficiency. (2nd 
Ed.) New York: Harper, 1953. 
Pp. x+402. $5.00. 


Dr. Sarason has written a very 
good, scholarly book. Professional 
students in the social and biological 
sciences and relatively well-informed 
laymen should find it unusually read- 
able. Some readers may react against 
arguments by assertion and the some- 
times polemic and hypercritical style, 
but these characteristics do not de- 
tract from the solid merits of the 
book. One gets the impression that 
it was written by an informed, crit- 
ical psychologist who was stimulated 
by his experience to prod us out of 
the lethargy and pessimism which 
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are so frequently found among pro- 
fessional workers in the field of men- 
tal deficiency. 

The book does not attempt a 
thorough coverage of the etiologies, 
characteristics, and treatment of the 
great variety of pathologies that are 
thrown together in this so-cálled 
field. Rather, it is most valuable for 
its points of view, the discussion of 
research, and the questions raised. 
I see Sarason's book as one of the 
opening guns in a revolution which 
may overthrow or, hopefully, dis- 
member this field of mental defi- 
ciency. For, if I read Sarason right 
or without too much projection, he 
is telling us that the mental defective 
is not what he seems or what the IQ 
says he is, that not all mentally de- 
fectivechildren are mentally defective, 
and that many things can and should 
be done about their problems. In 
other words, the measured intellect- 
ual level is not the problem but is 
only a symptom. Sarason's emphases 
upon cultural problems and the ef- 
fects of environmental stimulation 
and psychotherapy are important 
and sound. 

The net result of Sarason's argu- 
ments should be the recognition that 
the field consists of many special 
pathologies, organic, genetic, and 
psychogenic, which should be diag- 
nosed in terms of etiology and prog- 
nosis under appropriate treatment 
rather than in terms of intelligence 
test scores. He fails to make a com- 
plete break with the past, however; 
and the reader may get the impres- 
sion that the term mental deficiency 
still somehow describes a meaningf 
category or homogeneous group. | 

The frequent use and implications 
of the term "garden variety defec- 
tive" bother me. The term is used to 
refer both to cases of inherited defi- 
ciency and to psychogenic or cultur- 


L 


BOOK REVIEWS 


ally determined cases. The term is 
badly chosen as it is not at all con- 
sistent with Sarason's argument for 
individualized understanding and 
treatment, and it is rather meaning- 
less and offensive. Perhaps it is a 
quibble to question such statements 
about projective techniques as “in 
recent years the psychological pen- 
dulum seems to have swung from the 
Binet to the Rorschach as the basis 
for a diagnosis of mental deficiency” 
(p. 242). No evidence is given and 
Sarason’s discussion would seem to 
contradict it. 

The final chapters are concerned 
with the important problems of deal- 
ing with parents, the question of in- 
stitutionalization, and the inade- 
quacies of professional training. One 
can only applaud Sarason’s position 
and treatment of these problems. 
On the whole, the book will undoubt- 
edly improve the stature of profes- 
sional work in this broad but neg- 
lected field. 

KARL F. HEISER 

The Vineland Training School 


COMMITTEE ON COLORIMETRY OF THE 
Optica, Society OF AMERICA. 
The science of color. New York: 
Thomas Y. Crowell, 1953. Pp. 
xiii4-385. $7.00. 


The principal contribution of this 
i k may lie not so much in the 
actual data presented as in the ex- 
on of "color" as a psycho- 
E concept. This concept is of 
ti E importance and broad applica- 
E Es it will undoubtedly have 
2 E e influence on scientific 
R » ures in other research fields 
it e special senses, Even if it took 
i E to hammer out this basic 
N E e results may compensate for 
tion erwise disappointing publica- 


America’s foremost specialists in 


color collaborated in producing this 
most authoritative book on color— 
authoritative but inhomogeneous. 
Perhaps the work should have been 
presented as “The Present Status of 
Color as Viewed by Twenty-three 
Specialists” rather than as a defini- 
tive work. It is still an interim report, 
a combination of textbook, hand- 
book, essay, and committee paper. 
It covers the principal phases of 
color: physical, psychophysical, and 
psychological; it gives a miniature 
history of optics, a one-reel trave- 
logue through ancient civilizations, 
and the usual anatomical digest. 
The rapid run-through of instru- 
ments, methods, and standards is 
given with an economy suggesting 
that the bell is about to ring and the 
students are rustling their notebooks. 
As a book, it is marked by the lack 
of an over-all plan, unevenness, dis- 
continuity, minor errors, and many 
irritating kinks in arrangement. 
Nevertheless, it will stand on the 
right-hand bookshelf for many years 
because it has features which will not 
soon be supplanted. The references 
are comprehensive and well selected, 
especially in the psychological field. 
Professional workers will appreciate 
the most complete assortment of 
colorimetric tables that has yet ap- 
ed in one volume, Its major 
offering, “The Concept of Color," is 
thoroughly expounded for the reader 
who is willing to make the effort, 
though the material is scattered in 
the Introduction and parts of Chap- 
ters 2 and 7. The diagram in Figure 
5 is invaluable to this explanation and 
might well have been used as the 
frontispiece of the book. The glos- 
sary-index will be most useful to 
visitors from other scientific fields. 
Excellent color plates help to clarify 
the psychological organization of 
color sensations into three dimen- 
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sions. The large, diffuse field of color 
phenomena has been painstakingly 
screened and organized in the two 
chapters on “Psychological Con- 
cepts," though it is very difficult for 
the reader to find his way through 
such diverse material without a spe- 
cial table of contents. 

If the book has failed to reach its 
goal, we know it is not for lack of 
prodigious effort on the part of a 
number of the editors, for lack of 
erudition, or for lack of physical re- 
sources. Perhaps the difficulty lies 
in the times. Our best scientists are 
pushed at top speed on their jobs, 
and there is less time than formerly 
for the repose in which to reflect on 
the broader aspects of a part-time, 
unpaid assignment which must be 
taken out of family and social life. 
Also, all scientific fields have ex- 
panded so rapidly that it has become 
increasingly difficult for specialists 
to encompass them editorially. It is 
interesting to consider what form the 
book might have if the editor of the 
Scientific Monthly, for example, were 
asked to plan and supervise the next 
edition. 

It is curious that a group of scien- 
tists with international scope should 
write with such an insular view- 
point. This is strictly “the American 
science of color." "There is no sug- 
gestion here of the enlightening the- 
oretical approach to color space de- 
veloped by Schrodinger and Bouma, 
or of current developments on the 
Continent. Nor would the reader 
suspect the major contributions 
which have been made in psycho- 
physical research methods by the 
British in their several institutions. 

The neglect of the British field of 
research emphasizes an omission of 
fundamental importance in this as 
well as in most other recent books on 
color. Much space is devoted to 
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methods, formulae, tables, and sta- 
tistics for colorimetric work; but 
scarcely a paragraph can be found on 
the corresponding methodology and 
tools of experimental psychology. 
The superb instrumentation which 
has been developed by the physical 
and electronic technician has made it 
possible for anyone with a few thou- 
sand dollars to lay claim to be a 
“colorist.” This can easily lead to the 
dangerously naive viewpoint that 
psychological methodology is un- 
necessary or perhaps may be expected 
to ‘‘just come naturally." 

A book which is designed to fill 
such an important need deserves a 
future edition in which the psycho- 
physical and psychological sections 
are strengthened to balance the 
physical, in which the pertinency of 
other sections is reviewed, the in- 
clusion of now-omitted sciences pro- 
posed—and with the whole plan su- 
pervised by a professional editor-in- 
charge. 

DEAN FARNSWORTH 

Visual Engineering Section, 

U. S. Naval Submarine Base, 
New London 


Moreno, J. L. Who shall survive? 
New York: Beacon House, 1953. 
Pp. cxiv+763. $10.00. 


This is a very large book. It has 
over 100 pages of preface before it 
gets to page 1. Afterwards it still 
runs to over 700 pages. In the long 
preface the author tells us that: 

1. Sociometry is a religion as well 
as a science (e.g., page xv): “I triet 
to do through sociometry what reli- 
gion without science has failed to ac 
complish in the past... . ” 

2. The author is a genius (€^ 
page xxxvii): *All my books (nine 
published between 1919 and 1925, 
were anonymous. The natural state 
of genius is anonymity... 
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Words of the Father 1 wrote with red 
ink on the walls of an Austrian 
castle.” 

3. Sociometry is the final develop- 
ment (e.g., page Ixvii): "Actually, I 
have written two bibles, an old testa- 
ment and a new testament, The 
Words of the Father and Who Shall 
Survive?” 

4. Psychoanalysis is finished (e.g., 
page liv): “... the psychoanalytic 
system was stillborn to start with. 
... The psychoanalytic couch has be- 
come a piece of furniture in the socio- 
dynamic field of the psychodramatic 
stage.” 

5. Group therapy, group dynamics, 
and, it seems, all work on small 
groups and large groups are develop- 
ments of, or secessions from, the 
sociometric movement (pages lxix, 
lxx, and xcvii to cviii). 

The body of the text is devoted to 
an exposition of the theory, methods, 
and accomplishments of sociometry. 
While it is difficult to give a brief re- 
view of over 700 pages, I will try to 
list the major points which are made. 

1. The most important concepts of 
the sociometric system are spon- 
taneity, creativity, Tele, the social 
atom, and underlying social net- 
works. There is considerable discus- 
sion which conveys to the reader a 
vague feeling concerning the meaning 
of these terms. There are no very 
specific or rigorous definitions. 
po The basic techniques of soci- 
m are psychodrama, socio- 
b a, role-playing, group psycho- 
Gay’ and the sociometric test. 
aad the discussion of the last of 

e is at all elaborated in detail. 
dor = is little more than brief men- 

ie the others. The only empirical 
than presented concern the socio- 

Tic test, the essence of which is 


aski i 
prng people with whom they want 
9 associate, 
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3. A major portion is devoted to 
detailed description of two studies 
using this sociometric test. Data are 
presented to show that the distribu- 
tion of choices is different from 
chance, the pattern of choices 
changes with age, attractions among 
persons are not always in accordance 
with formal or existing associations, 
and that if groups are made up in ac- 
cordance with the desires of the mem- 
bers, these groups hold the members’ 
interest better. 

To this reviewer it seems that there 
are many pages devoted to stating 
how important sociometry is. There 
is, however, little space devoted to 
any convincing empirical demonstra- 
tion that these claims are valid. 

LEON FESTINGER. 

University of Minnesota. 


Morse, Nancy C. Satisfaction in 
the white-collar job. Ann Arbor: 
Univer. of Michigan, 1953. Pp. 
viii4-235. $3.50. 


It has long been part of the folk- 
lore of industry that a satisfied em- 
ployee is a productive employee. 
Recent investigations of the Michi- 
gan Survey Research Center have 
questioned this assumption. The in- 
dication that there is no one-to-one 
relationship between satisfaction and 
productivity has been disturbing 
both theoretically and practically. 
Morse has attempted in this volume 
to reduce theoretical tension by the- 
ory construction. 

Satisfaction, according to Morse, is 
“the individual's perception of the 
tension-level within him" (p. 32). 
This tension-level in turn is a func- 
tion of what an individual wants and 
what he receives (environmental re- 
turn). Morse further considers the 
amount of satisfaction derived from 
a given activity as a separate variable 
from motivation, i.e, "the willing- 
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ness to expend energy in a given 
activity" (p. 11). 

Productivity is, however, a much 
more complex phenomenon to pre- 
dict than satisfaction, as it is depend- 
ent on more factors. Morse postu- 
lates at least seven. The only factor 
which productivity has in common 
with satisfaction is the strength of 
needs. Accordingly, individuals will 
experience high satisfaction and be 
high in productivity if they have 
strong needs which are “predictably” 
related to productivity and where 
environmental return from the job is 
high. If environmental return is low, 
productivity may be high but the 
amount of satisfaction will be low. 

Morse emphasizes throughout the 
volume that the hypotheses concern- 
ing the nature of satisfaction and of 
productivity and their relationship 
are tentative and will require further 
investigation to verify, modify, or re- 
fute. As an indication of their possi- 
ble validity she offers evidence from 
the analysis of interview data with 
742 clerical employees and 73 first- 
and second-line supervisors in a large 
corporation. She points out the limi- 
tations, and possibly atypical nature, 
of this population. 

Her evidence, as far as it goes, sup- 
ports her hypotheses concerning both 
satisfaction and productivity.. The 
limitations of her data, in addition to 
population limitations, seem to be 
that the investigation was not orig- 
inally planned to test hypotheses 
concerning these two variables. For 
example, her indices of job satisfac- 
tion are limited to only three meas- 
ures. It seems that data collected for 
some other purpose were analyzed 
subsequently to throw light on the 
hypotheses presented. 
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She offers no evidence for the state- 
ment that the amount of satisfaction 
is a separate variable from motiva- 


tion. In fact, her definition of moti- 
vation as well as of morale differs 
from the ones usually found in texts 
on the subject. An author has the 
privilege of defining the terms. But 
unless this serves some useful pur- 
pose, the use of familiar terms in a 
different context tends to obscure 
rather than clarify meanings. 

Her evidence suffers from the 
weaknesses inherent in the interview 
method of securing data on problems 
of motivation, morale, and satisfac- 
tion. This is acknowledged tacitly in 
the discussion of the differences found 
between employees who receive gen- 
eral supervision as contrasted with 
closely supervised employees. The 
former were less enthusiastic about 
working for the company. This 
could have arisen, according — to 
Morse, because "they maybe are in à 
situation which demands less strict 
conformity, they are more free to ex- 
press reservations and criticisms" (p. 
142). It seems that for a critical test 
of her hypotheses, interview data will 
have to be supplemented with data 
collected by other means. 1 

Morse has, however, taken time 
out from collecting data to develop 
theory. It is the opinion of this re 
viewer that too little theory develop- 
ment has characterized the investiga" 
tions psychologists have made in 10- 
dustry. The result has been a collec- 
tion of unrelated and frequently con- 
tradictory data. A real need for ad- 
vances in industrial psychology ! 
good theory making. Morse has 
taken a step in the right direction. 

WiLLIAM MCGEHEE. 

Fieldcrest Mills, Inc. 
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THE CRITICAL INCIDENT TECHNIQUE 


JOHN C. FLANAGAN 
American Institute for Research and University of Pittsburgh 


During the past ten years the 
writer and various collaborators have 
been engaged in developing and uti- 
lizinga method that has been named 
the “critical incident technique." It 
is the purpose of this article to de- 
xribe the development of this meth- 
edology, its fundamental principles, 
and its present status. In addition, 
the findings of a considerable number 

| of studies making use of the critical 
incident technique will be briefly re- 
Viewed and certain possible further 
Uses of the technique will be indi- 
. tated. 

A The critical incident technique con- 
| Ssts of a set of procedures for col- 
ing direct observations of human 
vior in such a way as to facili- 
fate their potential usefulness in solv- 
mg practical problems and develop- 
mg broad psychological principles. 
The critical incident technique out- 
procedures for collecting ob- 
served incidents having special signifi- 
E and meeting systematically 

ned criteria, 
, Y an incident is meant any ob- 
*rvable human activity that is suf- 
m tly complete in itself to permit 
erences and predictions to be made 
Tobe the person performing the act. 
i. critical, an incident must occur 
. 3 Situation where the purpose or 
t of the act seems fairly clear to 
observer and where its conse- 
ces are sufficiently definite to 


leave little doubt concerning its 
effects. 

Certainly in its broad outlines. and 
basic approach the critical incident 
technique has very little which is. 
new about it. People have been mak- 
ing observations on other people for 
centuries. The work of many of the 
great writers of the past indicates 
that they were keen observers of their 
fellow men. Some of these writers 
must have relied on detailed notes 
made from their observations. Others 
may have had unusual abilities to 
reconstruct memory images in vivid 
detail. Some may have even made a 
series of relatively systematic obser- 
vations on many instances of a par- 
ticular type of behavior. Perhaps 
what is most conspicuously needed to 
supplement these activities is a set 
of procedures for analyzing and syn- 
thesizing such observations into a 
number of relationships that can be 
tested by making additional observa- 
tions under more carefully controlled 
conditions. 


BACKGROUND AND EARLY 
DEVELOPMENTS 


The roots of the present procedures 
can be traced back directly to the 
studies of Sir Francis Galton nearly 
10 years ago, and to later develop- 
ments such as time sampling studies 
of recreational activities, controlled 


observation tests, and anecdotal rec- 
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ords. The critical incident technique 
as such, however, can best be re- 
garded as an outgrowth of studies in 
the Aviation Psychology Program of 
the United States Army Air Forces 
in World War II. The Aviation Psy- 
chology Program was established in 
the summer of 1941 to develop pro- 
cedures for the selection and classifi- 
cation of aircrews. 

One of the first studies (40) carried 
out in this program was the analysis 
of the specific reasons for failure in 
learning to fly that were reported for 
1,000 pilot candidates eliminated 
from flight training schools in the 
summer and early fall of 1941. The 
basic source used in this analysis was 
the proceedings of the elimination 
boards. In these proceedings the 
pilot instructors and check pilots 
reported their reasons for eliminating 
the particular pilot. It was found 
that many of the reasons given were 
clichés and stereotypes such as “lack 
of inherent flying ability" and “in- 
adequate sense of sustentation," or 
generalizations such as "unsuitable 
temperament," ‘poor judgment," or 
"insufficient progress." However, 
along with these a number of specific 
observations of particular behaviors 
were reported. This study provided 
the basis for the research program on 
selecting pilots. Although it was 
found very useful, it also indicated 
very clearly the need for better pro- 
cedures for obtaining a representative 
sample of factual incidents regarding 
pilot performance. 

A second study (13), which empha- 
sized the importance of factual re- 
ports on performance made by com- 
petent observers, was carried out in 
the winter of 1943-1944 in the 8th, 
9th, 12th, and 15th Air Forces. This 
study collected the reasons for the 
failures of bombing missions as re- 
ported in the Group Mission Reports. 


Although in the preparation of these 
reports much greater emphasis was 
given to determining the precise facts 
in the case, it was apparent that in 
many instances the official reports 
did not provide a complete record of 
all the important events. Even with 
these limitations, the information 
given was found to be of considerable 
value, and the systematic tabulations 
that were prepared provided the basis 
for a series of recommendations that 
resulted in important changes in Air 
Force selection and training proce- 
dures. 

In the summer of 1944 a series of 
studies (74) was planned on the prob- 
lem of combat leadership in the 
United States Army Air Forces. 
These represent the first large-scale, 
systematic effort to gather specific 
incidents of effective or ineffective 
behavior with respect to à designated 
activity. The instructions asked the 
combat veterans to report incidents 
observed by them that involved be- 
havior which was especially helpful 
or inadequate in accomplishing the 
assigned mission. The statement 
finished with the request, “Describe 
the officer's action. What did he do? 
Several thousand incidents were cor- 
lected in this way and analyzed to 
provide a relatively objective an 
factual definition of effective combat 
leadership. The resulting set of de- 
scriptive categories was called the 
“critical requirements" of combat 
leadership. à 

Another study (74) conducted in 
the Aviation Psychology Program 1m- 
volved a survey of disorientation 
while flying. Disorientation 1n this 


! This study was planned by Paul M. PS 
Jr., who also contributed to the pe 
mentioned USAAF studies and plann ip 
carried out the interview study with pilots u 

trumente 


scribed below on the design of ins 
controls, and arrangements. 


THE CRITICAL INCIDENT TECHNIQUE 


study was defined to include any ex- 


perience denoting uncertainty as to 
one's spatial position in relation to 
the vertical. In this study pilots 
returning from combat were asked 
"to think of some occasion during 
combat flying in which you personal- 
ly experienced feelings of acute dis- 
orientation or strong vertigo." They 
were then asked to describe what they 
"saw, heard, or felt that brought on 
the experience." This study led to a 
number of recommendations regard- 
ing changes in cockpit and instru- 
ment panel design and in training in 
order to overcome and prevent verti- 
go while flying. 

i In a project carried out in the Avia- 
tion Psychology Program in 1946, 
Fitts and Jones (12) collected descrip- 
tions of specific experiences from 
pilots in taking off, flying on instru- 
ments, landing, using controls, and 
using instruments. These interviews 
with pilots were electrically recorded. 
They provided many factual inci- 
dents that were used as a basis for 
planning research on the design of 
instruments and controls and the 
ge of these within the cock- 
it. 

In addition to the collection of 
Specific incidents and the formulation 
of critical requirements, as outlined 
above, the summary volume (13) for 
gr Aviation Psychology Program 
Mech Reports contained a dis- 
ussion of the theoretical basis of 
Scores for obtaining the critical 

quirements of a particular activity. 
Athana the best method of describ- 
ng the status of these procedures at 

e close of the war is to quote from 
pu cussion in this summary vol- 
sand Which was written in the late 
ii of 1946. In the section on 

iques for defining job require- 


m 
d dam the present author wrote as 
ollows: 
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The principal objective of job analysis pro- 
cedures should be the determination of critical 
requirements. These requirements include 
those which have been demonstrated to have 
made the difference between success and fail- 
ure in carrying out an important part of the 
job assigned in a significant number of in- 
stance’ Too often, statements regarding job 
requirements are merely lists of all the desira- 
ble traits of human beings. These are prac- 
tically no help in selecting, classifying, or 
training individuals for specific jobs. To ob- 
tain valid information regarding the truly 
critical requirements for success in a specific 
assignment, ures were developed in the 
Aviation Psychology Program for making sys- 
tematic analyses of causes of good and poor 
performance. 

Essentially, the procedure was to obtain 
first-hand reports, or reports from objective 
records, of satisfactory and unsatisfactory exe- 
cution of the task assigned. The cooperating 
individual described a situation in which suc- 
cess or failure was determined by specific re- 
ported causes. 

This procedure was found very effective in 
obtaining information from individuals con- 
cerning their own errors, from subordinates 
concerning errors of their superiors, from su- 
pervisors with respect to their subordinates, 
and also from participants with respect to co- 
participants (13, pp. 273-274). 


DEVELOPMENTAL STUDIES AT 
THE AMERICAN INSTITUTE 
FOR RESEARCH 


At the close of World War II some 
of the psychologists who had par- 
ticipated in the USAAF Aviation 
Psychology Program established the 
American Institute for Research, a 
nonprofit scientific and educational 
organization. The aim of this organ- 
ization is the systematic study of 
human behavior through a coordi- 
nated program of scientific research 
that follows the same general princi- 
ples developed in the Aviation Psy- 
chology Program. It was in connec- 
tion with the first two studies under- 
taken by the Institute in the spring 
of 1947 that the critical incident 
technique was more formally de- 
veloped and given its present name. 

These studies were natural exten- 
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sions of the previous research in the 
Aviation Psychology Program. The 
study reported by Preston (52) dealt 
with the determination of the critical 
requirements for the work of an officer 
in the United States Air Force. In 
this study, many of the procedural 
problems were first subjected to 
systematic tryout and evaluation. 
Six hundred and forty officers were 
interviewed, and a total of 3,029 crit- 
ical incidents were obtained. This 
led to the development of a set of 58 
critical requirements classified into 
six major areas. The second study, 
reported by Gordon (27, 28), was 
carried out to determine the critical 
requirements of a commercial airline 
pilot. In this study, several different 
sources were used to establish the 
critical requirements of the airline 
pilot. These included training rec- 
ords, flight check records including 
the specific comments of check pilots, 
critical pilot behaviors reported in 
accident records, and critical inci- 
dents reported anonymously in in- 
terviews by the pilots themselves. 
From this study, 733 critical pilot 
behaviors were classified into 24 criti- 
cal requirements of the airline pilot's 
job. These were used to develop selec- 
tion tests to measure the aptitudes 
and other personality characteristics 
found critical for success in the job. 
They also provided the basic data 
for the formulation of an objective 
flight check to determine the eligi- 
bility of applicants for the airline 
transport rating. 

The third application of the critical 
incident technique by the staff of the 
American Institute for Research was 
in obtaining the critical requirements 
for research personnel on a project 
sponsored by the Psychological Sci- 
ences Division of the Office of Naval 
Research. In this study (20), about 
500 scientists in 20 research labora- 
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tories were interviewed. These sci- 
entists reported more than 2,500 
critical incidents. The critical be- 
haviors were used to formulate induc- 
tively a set of 36 categories, which 
constitutes the critical requirements 
for the effective performance of the 
duties of research personnel in the 
physical sciences. This initial study 
provided the basis for the develop- 
ment of selection tests, proficiency 
measures, and procedures for evaluat- 


ing both job performance ‘and the 
research report. ni 


Another project undertakerfby the 
American Institute for Research in 
the spring of 1948 provided valuable 
experience with the critical incident 
technique. This study, reported by 
Nagay (48), was done for the Civil 
Aeronautics Administration under 
the sponsorship of the Committee on 
Aviation Psychology of the National 
Research Council. It was concerned 
with the air route traffic controller's 
job. One of the innovations in this 
study was the use of personnel of the 
Civil Aeronautics Administration 
who had no previous psychological 
training in collecting critical inci- 
dents by means of personal inter. 
views. In previous studies all suc 
interviewing had been conducted | 
psychologists with extensive training 
in such procedures. In this study, 
aeronautical specialists from each 0 
the seven regions conducted the inter? 
views in their regions after 4 brie 
training period. An interesting find- 


f 


ing from this study was the clear re- — 


flection of seasonal variations in fly- 
ing conditions in the types of inci- 
dents reported. The study ba 
demonstrated the selective recall o 
dramatic or other special types ° 
incidents. 
noticeable in the incidents reporte 
several months after their occur- 
rence. The incidents obtained in this 


This bias was especially | 
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study were used to develop proce- 
dures for evaluating the proficiency of 
air route traffic controllers and also 
for developing a battery of selection 
tests for this type of personnel. 

In the spring of 1949 the American 
Institute for Research undertook a 
study to determine the critical job 
requirements for the hourly wage 
employees in the Delco-Remy Divi- 
sion of the General Motors Corpora- 
tion. This study, reported by Miller 
and. Flanagan (46), was the first 
apliation of these techniques in an 
industrial situation. Foremen who 
were members of a committee ap- 
pointed to develop employee evalua- 
tion procedures collected 2,500 criti- 
cal incidents in interviews with the 
other foremen in the plants. On the 
basis of these data a form was pre- 
pared for collecting incidents on a 

y-to-day basis as a continuous 

cord of job performance. 

Using this form, the Performance 
Record for Hourly Wage Employees 
(21), three groups of foremen kept 
Tecords on the performance of their 
employees for a two-week period. A 
group of 24 foremen recorded inci- 
nts daily; another group of 24 
remen reported incidents at the 
end of each week; and a third group 
Containing the same number of fore- 
YR reported incidents only at the 
*n of the two-week period. The 
ES groups of foremen represented 
$i parable conditions of work and 
oo The foremen reporting 

ly reported 315 critical incidents; 
oe reporting weekly, 155 
is ents; and the foremen reporting 
Bore at the end of two weeks 
ks ed 63 incidents. Thus, foremen 
ie ahaa only at the end of the 

hs io forgotten approximately 

nm [7 incidents. they would 

plan re under a daily reporting 
. € foremen who reported 
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only at the end of the two-week peri- 
od appeared to have forgotten 80 
per cent of the incidents observed. 
Although it is possible that the find- 
ings may be partially attributed to 
the fact that the foremen making 
daily records actually observed more 
critical incidents because of the daily 
reminder at the time of recording, it 
is clear that much better results can 
be expected when daily recording is 
used. 

Another analysis based on data 
collected at the Delco-Remy Division 
compared the number of critical in- 
cidents of various types obtained 
from interviews with those recorded 
daily by the foremen on the perform- 
ance record. Although there were 
some differences in the relative fre- 
quencies for specific categories, the 
general patterns appeared to be quite 
similar. These results suggest that 
critical incidents obtained from in- 
terviews can be relied on to provide 
a relatively accurate account of job 
performance if suitable precautions 
are taken to prevent systematic bias. 

In addition to the development of 
the performance record described 
above, the critical incidents collected 
in this study were used as the basis 
for constructing selection tests cover- 
ing both aptitude (18) and attitude 
(2) factors. 


STUDIES CARRIED OUT AT THE 
UNIVERSITY OF PITTSBURGH 


A substantial number of studies 
have been carried out in the depart- 
ment of psychology at the University 
of Pittsburgh by students working 
for advanced degrees under the au- 
thor's direction. Mostof these studies 
had as their objective the determina- 
tion of the critical requirements for 
a specific occupational group or ac- 
tivity. Many of them also included 
contributions to technique. ` In 1949 
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Wagner (66) completed a dissertation 
on the critical requirements for den- 
tists. In this study, critical incidents 
were obtained from three sources: 
patients, dentists, and dental school 
instructors. The incidents were clas- 
sified into four main aspects of the 
dentist's job: (a) demonstrating 
technical proficiency; (b) handling 
patient relationships; (c) accepting 
professional responsibility; and (d) 
accepting personal responsibility. As 
might be expected, the patients did 
not report as large a proportion of 
incidents for demonstrating technical 
proficiency or accepting professional 
responsibility as did the other two 
groups, and the instructors reported 
only a relatively small proportion of 
their incidents in the area of handling 
patient relationships. 

On the basis of the findings from 
this study, a battery of selection tests 
was developed for use by the Uni- 
versity of Pittsburgh School of Den- 
tistry. A number of proficiency tests 
for measuring ability with respect 
to certain of the critical requirements 
were also developed using these re- 
sults as a basis. 

Another dissertation completed in 
1949 was Finkle’s (11) study of the 
critical requirements of industrial 
foremen. This study was conducted 
in the East Pittsburgh plant of the 
Westinghouse Electric Corporation. 
Critical incidents were obtained from 
foremen, general foremen, and staff 
personnel. A number of points per- 
taining to technique were studied. 

One finding was in reference to the 
effect on the types of incidents ob- 
tained of the degree of importance 
or exceptionalness set up as a criterion 
for reporting or ignoring incidents. 
The incidents obtained from the use 
of questions that asked for incidents 

only slightly removed from the norm 
were compared with incidents ob- 
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tained from questions intended to 
elicit more definitely effective or in- 
effective behaviors. Some examples 
of these questions are: 

1. Think of a time when a foreman has done 
something that you felt should be encouraged 
because it seemed to be in your opinion an ex» 
ample of good foremanship. (Effective— 
slight deviation from norm.) 

2. Think of a time when a foreman did 
something that you thought was not up 
to par. (Ineffective—slight deviation from 
norm.) 

3. Think of a time when a foreman has, in 
your opinion, shown definitely good foreman- 
ship—the type of action that points out the 
superior foreman. (Effective—substantial de- 
viation from the norm.) 2 

4. Think of a time when a foreman has, in 
your opinion, shown poor foremanship— 
sort of action which if repeated would indicate 
that the man was not an effective foreman. 
(Ineffective—substantial deviation from 
norm.) 

The frequencies of incidents ob- 
tained in each of the 40 categories 
into which the effective behaviors 
were classified were compared for 
the questions requesting slight and 
substantial deviations from the norm, 
and the significance of the differences 
was tested by means of the chi- 
square test. Two of the differences 
were significant at the 1 per cent 
level and one at the 5 per cent level. _ 
Comparisons of the frequencies In 
each of the 40 categories for ineffec- 
tive incidents failed to reveal any 
chi squares significant at either the 
5 per cent or the 1 per cent level. 

The questions involving only à 
slight deviation from the norm re- 
sulted in more effective incidents 
concerned with gaining the respect 
and loyalty of the workers and also 
in. more incidente that involved 
making, encouraging, and accepting 
suggestions, They produced signi- 
cantly fewer incidents regarding t 
ting men to jobs. The small num 
of significant differences—only FO 
in 80 comparisons—suggests that t 


is of incidents obtained are not 
fy greatly changed by variations 
wording of the questions compa- 
ble to those shown above. It seems 
‘that this is at least partially 
the fact that the persons in- 
ewed report only incidents that 
esent a fairly substantial devia- 
‘from the norm regardless of 
precise wording of the question 
ther comparison made in this 
related to the influence of ask- 
or an effective or an ineffective 
entfirst. About 10 per cent more 
nts were obtained from book- 
questing effective incidents first 
rom booklets requesting in- 
ective incidents first. This differ- 
"was sufficiently small so that 
uld reasonably be attributed to 
ce sampling fluctuations. 

e incidents collected in this 
y were used, along with other 
in the preparation of a Perform- 
Record for Foremen and Super- 
s (23). 

‘Study was conducted by Nevins 
Lon the critical requirements of 
okkeepers in sales companies. She 
ted incidents relating to appli- 
for bookkeeping positions as 
as for employees working in this 
ity. 

For the collection of the informa- 
m about the practicing bookkeep- 
» à modification in the critical inci- 
technique was made. This was 
because, in the bookkeeping 
ession, success and failure are 
ually defined in terms of persistent 
Vioral patterns. Occasional mis- 
in adding and balancing ac- 
ts are expected, but repeated 
Sare considered serious. Instead 
the single incident, therefore, 
of the items included repre- 
either a pattern of behaviors or 
les of similar behaviors. 
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Weislogel (72) determined the crit- 
ical requirements for life insurance 
agency heads. A principal feature 
of his study related to the compari- 
son of two types of agency heads— 
managers and general agents. It was 
believed. that the critical behaviors 
for one type of agency head might 
provide a different pattern than that 
obtained for the other. This hypothe- 
sis was not confirmed by the analysis 
of the obtained incidents. The pat- 
terns of critical requirements were 
found to be quite similar for the two 
types of administrators. 

Smit (58) carried out a study to 
determine the critical requirements 
for instructors of general psychology 
courses. Perhaps the finding of most 
general importance in this study was 
the existence of substantial differ- 
ences between the patterns of critical 
incidents reported by students and 
faculty. The faculty reported a signif- 
icantly larger percentage of effective 
behaviors in the following areas: 
giving demonstrations or experi- 
ments, using discussion group tech- 
niques, encouraging and ascertaining 
students’ ideas and opinions. 

The students, on the other hand, 
contributed a larger percentage of 
behaviors in the following areas: re- 
viewing examinations, distributing 
grades, and explaining grades; using 
lecture aids such as drawings, charts, 
movies, models, and apparatus; using 
project techniques; giving test ques- 
tions on assigned material; helping 
students after class and during class 
recess; the manner of the instructor. 

The faculty reported a larger per- 
centage of ineffective behaviors con- 
cerning maintaining order. The in- 
effective behaviors that were reported 
ina larger percentage by students in- 
volved these areas: presenting re- 
quirements of the course, using effec- 
tive methods of expression, dealing 
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with students' questions, pointing 
out fallacies, reviewing and summa- 
rizing basic facts and principles, using 
project techniques, using verbal diag- 
nostic teaching techniques, achieve- 
ment testing students on assigned 
material, objective type achievement 
testing, using humor. 

This is a good illustration of the 
problem of the competence of various 
types of available observers to evalu- 
ate the contribution to the general 
aim of the activity of a specific action. 
Examination of the reports from stu- 
dents indicated a somewhat limited 
sphere of competence. Apparently 
one of the principal reasons for this 
was the lack of perspective on the 
part of the students and their inabil- 
ity to keep the general aim of the 
instructor clearly in mind because of 
its divergence from their own im- 
mediate aims, In many cases, this 
latter aim seemed to be directed 
toward achieving a satisfactory grade 
in the course. 

Eilbert (7) developed a functional 
description of emotional immaturity. 
The contributors of critical incidents 
included psychiatrists, psychologists, 
psychiatric social workers, occupa- 
tional therapists, nurses, and corps- 
men from a military hospital, plus 
13 psychologists in nonmilitary or- 
ganizations. The subjects of the in- 
cidents were primarily patients under 
psychiatric care. 

The contributors were given a form 
that oriented them to the concept 
"emotional immaturity’ by sug- 
gesting that it was revealed generally 
by childlike modes of behavior. The 
questions used to elicit incidents 
were: Have you recently thought of 
someone as being emotionally im- 
mature (regardless of diagnosis)? 
What specifically happened that gave 
you this impression? What would 
have been a more mature reaction to 
the same situation? 
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Because of the indefinite nature of 
the concept, it was felt that a check 
should be made on the contributor's 
understanding of his task. Twenty of 
the participating persons were asked 
to summarize briefly their interpreta- 
tion of what they had been asked to 
do. This appeared to be very useful 
in developing the phrasing of the 
questions so that they were uniformly 
interpreted by the various observers. 

The author of the study classified 
all the immaturities on the basis of a 
classification system developed from 
preliminary categorizations prepared 
by six of the contributors. This clas- 
sification was submitted to 14 psychi- 
atrists for review. They were asked 
to indicate which of the categories 
they were willing to accept as a type 
of immaturity as the term had been 
defined in an official document. 
More than half the categories were 
accepted by at least 13 of the 14 
judges, and none was rejected by more 
than 50 per cent of the judges. It was 
felt then that the system was accept- 
able. J 

This study illustrates the applica- 
tion of the critical incident technique 
to the study of personality. It is 
believed that this study provides an 
excellent example of the possibilities 
for developing more specific behavi- 
oral descriptions. Mr 

Folley (24) reported on the critical 
requirements of sales clerks in depart- 
ment stores. The behaviors Were 
abstracted from narrative records 0 
individual shopping incidents written 
by shoppers who were relatively 1m- 
experienced in evaluating sales per- 
sonnel. For various reasons, including 
the competence of the observers 
their training, and their limited point 
of view, the resulting description 
must be regarded as only partial. 


In the past few years, many other - 


have made 


individuals and groups 1 
descri 


use of the techniques 
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above, or modifications of them, in a 
wide variety of studies. Some of these 
studies on which reports are being 
published will be reviewed briefly in 
the section on applications. 


THE PROCEDURE IN Its 
PRESENT FORM 


From the foregoing discussion, it is 
clear that the critical incident tech- 
nique is essentially a procedure for 
gathering certain important facts 
concerning behavior in defined situa- 
tions. It should be emphasized that 
the critical incident technique does 
not consist of a single rigid set of 
rules governing such data collection. 
Rather it should be thought of as a 
flexible set of principles which must 
be modified and adapted to meet the 
specific situation at hand. 

The essence of the technique is 
that only simple types of judgments 
are required of the observer, reports 
from only qualified observers are 
included, and all observations are 
evaluated by the observer in terms of 
an agreed upon statement of the pur- 
pose of the activity. Of course, sim- 
plicity of judgments is a relative 
matter. The extent to which a re- 
ported observation can be accepted 
as a fact depends primarily on the 
objectivity of this observation. By 
Objectivity is meant the tendency for 
à number of independent observers 
to make the same report. Judgments 
that two things have the same effect 
or that one has more or less effect 
than the other with respect to some 
Rod purpose or goal represent the 
implest types of judgments that can 
h made. The accuracy and therefore 

€ objectivity of the judgments de- 
Ed on the precision with which the 

aracteristic has been defined and 
Bite competence of the observer in 
tio Tpreting this definition with rela- 
tà to the incident observed. In this 
ter process, certain more difficult 
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types of judgments are required re- 
garding the relevance of various con- 
djtions and actions on the observed 
success in attaining the defined pur- 
pose for this activity. 

It is believed that a fair degree of 
success has been achieved in develop- 
ing procedures that will be of assist- 
ance in gathering facts in a rather 
objective fashion with only a mini- 
mum of inferences and interpreta- 
tions of a more subjective nature. 
With respect to two other steps that 
are essential if these incidents are to 
be of value a comparable degree of 
objectivity has not yet been ob- 
tained. In both instances, the sub- 
jective factors seem clearly due to 
current deficiencies in psychological 
knowledge. 

The first of these two other steps - 
consists of the classification of the 
critical incidents. In the absence of 
an adequate theory of human be- 
havior, this step is usually an in- 
ductive one and is relatively subjec- 
tive. Once a classification system has 
been developed for any given type of 
critical incidents, a fairly satisfactory 
degree of objectivity can be achieved 
in placing the incidents in the defined 
categories. 

The second step refers to inferences 
regarding practical procedures for 
improving performance based on the 
observed incidents. Again, in our 
present stage of psychological knowl- 
edge, we are rarely able to deduce or 
predict with a high degree of con- 
fidence the effects of specific selection, 
training, or operating procedures on 
future behaviors of the type observed. 
The incidents must be studied in the 
light of relevant established principles 
of human behavior and of the known 
facts regarding background factors 
and conditions operating in the spe- 
cific situation. From this total pic- 
ture hypotheses are formulated. In 
only a few types of activities are there 
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both sufficient established principles 
and sufficient information regarding 
the effective factors in the situation 
to provide a high degree of confidence 
in the resulting hypotheses regarding 
specific procedures for improving the 
effectiveness of the results. 

In the sections which follow, the 
five main steps included in the pres- 
ent form of the procedures will be 
described briefly. In order to pro- 
vide the worker with maximum flexi- 
bility at the present stage, in addition 
to examples of present best practice, 
the underlying principles for the step 
will be discussed and also the chief 
limitations with, wherever possible, 
suggestions for studies that may re- 
sult in future improvements in the 
methods. 


1. General Aims 


A basic condition necessary for any 
work on the formulation of a func- 
tional description of an activity is a 
fundamental orientation in terms of 
the general aims of the activity. No 
planning and no evaluation of specific 
behaviors are possible without a 
general statement of objectives. The 
trend in the scientific field toward 
operational statements has led a 
number of writers to try to describe 
activities or functions in terms of the 
acts or operations performed, the 
materials acted on, the situations in- 
volved, the results or products, and 
the relative importance of various 
acts and results. These analyses have 
been helpful in emphasizing the need 
for more specific and detailed de- 
scriptions of the requirements of ac- 
tivities. Typically, however, such 
discussions have failed to emphasize 
the dominant role of the general aim 
in formulating a description of suc- 
cessful behavior or adjustment in a 
particular situation. 

In its simplest form, the functional 
description of an activity specifies 
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precisely what it is necessary to do 
and not to do if participation in the 
activity is to be judged successful or 
effective. It is clearly impossible to 
report that a person has been either 
effective or ineffective in a particular 
activity by performing a specific act 
unless we know what he is expected 
to accomplish. For example, a super- 
visor's action in releasing a key 
worker for a half a day to participate 
in a recreational activity might be 
evaluated as very effective if the 
general aim of the foreman was to get 
along well with the employees under 
him. On the other hand, this same 
action might be evaluated as ineffec- 
tive if the primary general aim is the 
immediate production of materials or 
services. 

In the case of the usual vocational 
activities the supervisors can be ex- 
pected to supply this orientation. In 
certain other types of activities, such 
as civic, social, and recreational ac- 
tivities, there frequently is no super- 
visor. The objectives of participation 
in the activity must then be deter- 
mined from the participants them- 
selves. In some instances, these may 
not be verbalized to a sufficient ex- 
tent to make it possible to obtain 
them directly. k 

Unfortunately, in most situations 
there is no one general aim which is 
the correct one. Similarly, there 1$ 
rarely one person or group of persons 
who constitute an absolute, authori- 
tative source on the general aim 0 
the activity. In a typical manufac- 
turing organization the foreman, the 
plant manager, the president, and the 
stockholders might define the general 
aim of the workers in a particular 
section somewhat differently. It !5 
not possible to say that one of these 
groups knows the correct general aim 
and the others are wrong. This does 
not mean that one general aim is 45 
good as another and that it is unim- 
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portant how we define the purpose of 
the activity. It does mean that we 
cannot hope to get a completely ob- 
jective and acceptable general aim for 
a specific activity. The principal 
criterion in formulating procedures 
for establishing the general aim of the 
activity should be the proposed use 
of the functional description of the 
activity which is being formulated. 
Unless the general aim used is accept- 
able to the potential users of the de- 
tailed statement of requirements, the 
whole effort in formulating this state- 
ment will have been wasted. 

The most useful statements of aims 
seem to center around some simple 
phrase or catchword which is slogan- 
like in character. Such words provide 
a maximum of communication with 
only a minimum of possible misinter- 
pretation. Such words as *'apprecia- 
tion," "efficiency," "development," 
"production," and "service" are 
likely to be prominent in statements 
of general aims. For example, the 
general aim of a teacher in elemen- 
tary school art classes might be the 
development of an appreciation of 
various visual art forms on the part 
of the students. The general aim of 
the good citizen might be taken as 
effective participation in the develop- 
ment and application of the rules and 
procedures by which individuals and 
groups are assisted in achieving their 
various goals. 

With the aid of a form of the type 
Shown in Fig. 1, the ideas of a num- 
ber of well-qualified authorities can 
be collected. It is expected that in 
Tesponse to the question on the pri- 
Mary purpose of the activity many 
Persons will give a fairly lengthy and 
detailed statement. The request to 
ed is expected to get them to 
Ee ense this into a brief usable 
: ement. These should be pooled 
nd a trial form of the statement of 
8eneral aim developed. This state- 
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ment should be referred either to 
these authorities or to others to ob- 
tain a final statement of the general 
aim that is acceptable to them. 
Necessary revisions should be made 
as indicated by these discussions. 
Usually considerable effort is required 
to avoid defeating the purpose of the 


OUTLINE FOR INTERVIEW TO 
ESTABLISH THE GENERAL 
Arm FOR AN ACTIVITY 

1. Introductory statement: We are mak- 
ing a study of (specify activity). We 
believe you are especially well quali- 
fied to tell us about (specify ac- 
tivity). 

. Request for general aim: What would 
you say is the primary purpose of 
(specify activity)? 

. Request for summary: In a few 
words, how would you summarize 
the general aim of (specify activ- 
ity)? 


Fic. 1. SAMPLE For For USE IN OBTAIN- 
ING GENERAL AIM 


general aim by cluttering up the 
statement with specific details and 
qualifying conditions. 

In summary, the general aim of an 
activity should be a brief statement 
obtained from the authorities in the 
field which expresses in simple terms 
those objectives to which most people 
would agree. Unless a brief, simple 
statement has been obtained, it will 
be difficult to get agreement among 
the authorities. Also it will be much 
harder to convey a uniform idea to 
the participants. This latter group 
will get an over-all impression and 
this should be as close to the desired 
general aim as possible. 


2. Plans and Specifications 

To focus attention on those aspects 
of behavior which are believed to be 
crucial in formulating a functional 
description of the activity, precise 
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instructions must be given to the 
observers. It is necessary that these 
instructions be as specific as possible 
with respect to the standards to be 
used in evaluation and classification. 
The group to be studied also needs 
to be specified. 

One practical device for obtaining 
specific data is to obtain records of 
“critical incidents" observed by the 
reporting personnel. Such incidents 
are defined as extreme behavior, 
either outstandingly effective or in- 
effective with respect to attaining the 
general aims of the activity. The 
procedure has considerable efficiency 
because of the use of only the ex- 
tremes of behavior. It is well known 
that extreme incidents can be more 
accurately identified than behavior 
which is more nearly average in 
character. 

One of the primary aims of scien- 
tific techniques is to insure objectiv- 
ity for the observations being made 
and reported. Such agreement by 
independent observers can only be 
attained if they are all following the 
same set of rules. It is essential that 
these rules be clear and specific. In 
most situations the following speci- 
fications will need to be established 
and made explicit prior to collecting 
the data: 

a. The situations observed. The first neces- 
sary specification is a delimitation of the situa- 
tions to be observed. This specification must 
include information about the place, the per- 
sons, the conditions, and the activities. Such 
specifications are rather easily defined in many 
instances. For example, such brief specifica- 
tions as observations of ‘‘the behavior in class- 
rooms of regularly employed teachers in a 
specified high school while instructing students 
during class periods," constitute a fairly ade- 
quate definition of a situation of this type. 

In complex situations it is probably essen- 
tial not only that the specifications with re- 
spect to the situation be relatively complete 
and specific, but also that practical examples 
be provided to assist the observer in deciding 
in an objective fashion whether or not a spe- 
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cific behavior should be observed and re- 
corded. 

b. Relevance to the general aim. After the de- 
cision has been made that a particular situa- 
tion is an appropriate one for making observa- 
tions, the next step is to decide whether or not 
a specific behavior which is observed is rele- 
vant to the general aim of the activity as de- 
fined in the section above. For example, if the 
general aim of the activity was defined as sus- 
tained high quality and quantity of produc- 
tion, it might be difficult to decide whether or 
not to include an action such as encouraging 
an unusually effective subordinate to get train- 
ing that would assist him in developing his 
ability in an avocational or recreational activ- 
ity not related to his work. In this case, it 
might be specified that any action which either 
directly or indirectly could be expected over a 
long period of time to have a significant effect 
on the general aim should be included. If it 
could not be predicted with some confidence 
whether this effect would be good or bad, it 
should probably not be considered. 

The extent of detail required to obtain ob- 
jectivity with respect to this type of decision 
depends to a considerable degree on the back- 
ground and experiences of the observers with 
respect to this activity. For example, super- 
visors with substantial experience in a particu- 
lar company can be expected to agree om 
whether or not a particular behavior is rele- 
vant to the attainment of the general aim. 
On the other hand, if outside observers were 
to be used, it would probably be necessary to 
specify in considerable detail the activities 
that can be expected to have an effect on the 
general aim. 

c. Extent of effect on the general aim. The re- 
maining decision that the observer must make 
is how important an effect the observed inci- 
dent has on the general aim. It is necessary to 
specify two points on the scale of importance: 
(a) a level of positive contributions to the gen- 
eral aim in specific terms, preferably including 
a concrete example, and (b) the corresponding 
level of negative effect on the general aim ex- 
pressed in similar terms. A 

A definition which has been found useful is 
that an incident is critical if it makes à ` sig- 
nificant” contribution, either positively OF 
negatively, to the general aim of the activity. 
The definition of "significant" will depend on 
the nature of the activity. If the general aim 
of the activity is in terms of production, a S187 
nificant contribution might be one whi : 
caused, or might have caused, an appreciable 
change in the daily production of the depart- 
ment either in the form of an increase or a de- 
crease. 
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In certain specific situations, it might be de- 

sirable and possible to set up a quantitative 

criterion such as saving or wasting 15 minutes 

of an average worker's production. In some 

situations, a definition of significance might be 

set up in terms of dollars saved or lost both di- 
rectly and indirectly. 

Actions which influence the attitudes of oth- 
ers are more difficult to evaluate objectively. 
Perhaps the best we might be able to do is to 
state it in terms of a probability estimate. For 
example, one such criterion might be that the 
minimum critical level would be an action that 
would have an influence such that at least one 
person in ten might change his vote on an issue 
of importance to the company. 

d. Persons to make the observations. One ad- 
ditional set of specifications refers to the selec- 
tion and training of the observers who are to 
make and report the judgments outlined in the 
steps above. 

Wherever possible, the observers should be 
selected on the basis of their familiarity with 
the activity. Special consideration should be 
given to observers who have made numerous 
observations on persons engaged in the activ- 

| ity. Thus, for most jobs, by far the best ob- 
servers are supervisors whose responsibility it 
isto see that the particular job being studied is 
done. However, in some cases very useful ob- 
servations can be contributed by consumers of 
the products and services of the activity. For 

. example, for a study of effective sales activi- 
ties, the customers may have valuable data to 
Contribute. For a study of effective parental 
activity, the children may be able to make val- 
uable contributions. 

In addition to careful selection of the per- 
Sons to make observations, attention should 

given to their training. Minimal training 
Should include a review of the nature of the 
general aim of the activity and a study of the 
Specifications and definitions for the judg- 

‘ ments they will be required to make. Where 
the situation is complex or the observers are 
not thoroughly familiar with the activity, su- 
Sere practice in applying these definitions 

ould be provided. This can be done by pre- 
eed descriptions of observations and asking 

observers to make judgments about these 
Paw Their judgments can be immedi- 

y confirmed or corrected during such su- 
Petvised practice periods. 
zi Fig. 2 is shown a form for use in develop- 
Be nations regarding observations. The 
E form in making plans for the collec- 
i. f critical incidents or other types of ob- 

‘ational data should aid in objectifying 

ese specifications. 
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3. Collecting the Data 


If proper plans and specifications 
are developed, the data collection 
phase is greatly simplified. A neces- 
sary condition for this phase is that 
the behaviors or results observed be 
evaluated, classified, and recorded 
while the facts are still fresh in the 
mind of the observer. It would be 
desirable for these operations to be 


Specifications Regarding Observations 


1. Persons to make the observations. 
a. Knowledge concerning the activ- 


ity. 
b. Relation to those observed. 
c. Training requirements. 
2. Groups to be observed. 
a. General description. 
b. Location. 
c. Persons. 
d. Times. 
e. Conditions. 
3. Behaviors to be observed. 
a. General type of activity. 
b. Specific behaviors. 
c. Criteria of relevance to general 
aim. 
d. Criteria of importance to general 
aim (critical points) 


Fic. 2. Form ror DEVELOPING SPECIFICA- 
TIONS REGARDING OBSERVATIONS 


performed at the time of observation 
so that all requisite facts could be 
determined and checked. Memory is 
improved if it is known in advance 
that the behavior to be observed is to 
be remembered, It is greatly im- 
proved if the specific aspects of what 
is to be observed are defined and if 
the operations to be performed with 
respect to evaluation and classifica- 
tion are clearly specified. 2 l 

The critical incident technique 1s 
frequently used to collect data on 
observations previously made which 
are reported from memory. This is 
usually satisfactory when the in- 
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cidents reported are fairly recent and 
the observers were motivated to 
make detailed observations and eval- 
uations at the time the incident 
occurred. 

The importance of obtaining re- 
cent incidents to insure that the in- 
cidents are representative of actual 
happenings was demonstrated in the 
study on air route traffic control- 
lers by Nagay (48) reported above. 
However, as also discussed in that 
study, in some situations adequate 
coverage cannot be obtained if only 
very recent incidents are included. 

Evidence regarding the accuracy 
of reporting is usually contained in 
the incidents themselves. If full and 
precise details are given, it can 
usually be assumed that this informa- 
tion is accurate. Vague reports sug- 
gest that the incident is not well re- 
membered and that some of the data 
may be incorrect. In several situa- 
tions there has been an opportunity 
to compare the types of incidents 
reported under two conditions (a) 
from memory and without a list of 
the types of incidents anticipated, 
and (b) those reported when daily 
observations were being made in a 
routine work situation, and the eval- 
uations and classifications were made 
and recorded on a prepared check list 
within 24 hours of the time of obser- 
vation. The results of one such com- 
parison were discussed briefly above 
in connection with the American 
Institute for Research study of fac- 
tory employees. 

During the observational period a 
negligible number of incidents were 
reported by the foremen as not fitting 
into the general headings included on 
thelist. Although the proportions of 
incidents for the various items on the 
list are not identical, they are reason- 
ably close for most of the items. 
Items on such matters as meeting 
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production requirements and accept- 
ing changes in jobs are higher in 
terms of the recorded than the re- 
called incidents. The fact that items 
such as wasting time and assisting on 
problems are lower for the recalled 
incidents suggests that part of this 
discrepancy lies in the interpretations 
of the category definitions. The 
classifying of recorded incidents was 
done by the foremen, while the classi- 
fication of the recalled incidents was 
done by the research workers. In 
fairness, it should also be noted that 
the definitions used by the research 
workers were rewritten before they 
were incorporated in the foremen's 
manuals. 

On the whole, it seems reasonable 
to assume that, if suitable precau- 
tions are taken, recalled incidents can 
be relied on to provide adequate data 
for a fairly satisfactory first approxi- 
mation to a statement of the require- 
ments of the activity. Direct obser- 
vations are to be preferred, but the 
efficiency, immediacy, and minimum 
demands on cooperating personnel 
which are achieved by using recalled 
incident data frequently make their 
use the more practical procedure. 

Another practical problem in col- 
lecting the data for describing am 
activity refers to the problem of how 
it should be obtained from the ob- 
servers. This applies especially to 
the problem of collecting recalled 
data in the form of critical incidents. 
Four procedures have been used an 
will be discussed briefly below: 


a. Interviews. The use of trained personnel 
to explain to observers precisely what data are 
desired and to record the incidents, making 
sure that all necessary details are supplied, is 
probably the most satisfactory data collection 
procedure. This type of interview is somewha 
different from other sorts of interviews an a 
brief summary of the principal factors invol 
will be given. 


(i) Sponsorship of the study. If a stranger to 
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the observers is collecting the data, it is ordi- 
narily desirable to indicate on what authority 
the interview is being held. This part should 
be as brief as possible to avoid any use of time 
fora prolonged discussion of a topic irrelevant 
to the purpose of the interview. In many in- 
stances all that needs to be said is that some- 
one known and respected by the observer has 
suggested the interview. 

(ii) Purpose of the study. This should also 
be brief and ordinarily would merely involve a 
statement that a study was being made to de- 
scribe the requirements of the activity. This 
would usually be cast in some such informal 
form as, “We wish to find out what makes a 
good citizen," or, "We are trying to learn in 
detail just what successful work as a nurse in- 
dudes." In cases where there is some hesita- 
tion about cooperating or a little more expla- 
nation seems desirable, a statement can be 
added concerning the value and probable uses 
of the results. This frequently takes the form 
of improving selection and training proce- 
dures. In some instances, it would involve im- 
proving the results of the activity. For ex- 
ample, the interviewer might say, ‘In order to 
get better sales clerks we need to know just 
what they do that makes them especially ef- 
fective or ineffective,” or, “If parents are to 
be more effective, we need to be able to tell 
them the things they do that are effective and 
ineffective,” 

(iii) The group being interviewed. If there is 
any likelihood of a person feeling, “But, why 
ask me?" it is desirable to forestall this by 
Pointing out that he is a member of a group 
which is in an unusually good position to ob- 
Serve and report on this activity. The special 
Balitcations of members of this group as ob- 
E can be mentioned briefly, as, “Super- 
i * "eua as yourself are constantly observ- 
E in evaluating the work of switchboard 
NAE or, "Students are in an unusually good 
la pon to observe the effectiveness of their 

n ers in a number of ways." 
ni] The anonymity of the data. Especially 
E^ ren of information about inef- 
E ehavior, one of the principal prob- 
m à convince the observer that his report 
Yay. Sonn the person reported on in any 
CH ually he also needs to be convinced 
that he Jap reported on will never know 
are not s portec the incident. Assurances 
actual E ien y so effective in this situation as 

ndling Uia of techniques to be used in 
o judge ud ata, which enable the observer 

Poo ce age how well the anonymity of 
stances a ^ guarded. Under no circum- 

Water | the confidences of the reportees 

in any way. The use of sealed en- 
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velopes, avoidance of identifying information, 
the mailing of data immediately to a distant 
point for analysis, and similar techniques are 
helpful in establishing the good faith of the in- 
terviewer in taking all possible precautions to 
safeguard the incidents reported. 

(v) The question. The most crucial aspect 
of the data collection procedure is the ques- 
tions asked the observers. Many studies have 
shown that a slight change in wording may 
produce a substantial change in the incidents 
reported. For example, in one study the last 
part of one of the specific questions asked was, 
“Tell just how this employee behaved which 
caused a noticeable decrease in production.” 
This question resulted in almost all incidents 
reported having to do with personality and at- 
titude behaviors. This part of the question was 
changed to, ‘Tell just what this employee did 
which caused a noticeable decrease in produc- 
tion." This second question produced a much 
broader range of incidents. To the question 
writer “how he behaved” and “what he did” 
seemed like about the same thing. To the fore- 
men who were reporting incidents “how he be- 
haved” sounded as if personality and atti- 
tudes were being studied. The subtle biases 
involved in the wording of questions are not 
always so easily found. Questions should al- 
ways be tried out with a small group of typical 
observers before being put into general use in 
a study. 

The question should usually refer briefly to 
the general aim of theactivity. This aim might 
be discussed more fully in a preliminary sen- 
tence. It should usually state that an incident, 
actual behavior, or what the person did is de- 
sired. It should briefly specify the type of be- 
havior which is relevant and the level of im- 
portance which it must reach to be reported. 
It should also tie down the selection of the in- 
cidents to be reported by the observer in some 
way, such as asking for the most recent obser- 
vation, in order to prevent the giving of only 
the more dramatic or vivid incidents, or some 
other selected group, such as those which fit 
the observer's stereotypes. 4 

An effective procedure for insuring that the 
interpretation of the persons being interviewed 
is close to that intended is to request a sample 
of persons typical of those to be interviewed to 
state in their own words what they understand 
they have been asked to do. These persons 
should be selected so as to represent all types 
who will be interviewed. Froma study of their 
interpretations, necessary revisions can be 
made to insure that all interviewees will be in 
agreement as to the nature of the incidents 


they are to provide. $ Ñ 
(vi) The conversation. The interviewer 
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should avoid asking leading questions after 
the main question has been stated. His re- 
marks should be neutral and permissive and 
should show that he accepts the observer as 
the expert. By indicating that he understands 
what is being said and permitting the observer 
to do most of the talking, the interviewer can 
usually get unbiased incidents. If the question 
does not seem to be understood, it can be re- 
peated with some reference to clarifying just 
what is meant by it. If the observer has given 
what seems like only part of the story, he 
should be encouraged by restating the essence 
of his remarks. This usually tends to encour- 
age him to continue and may result in his 
bringing out many relevant details that the in- 
terviewer did not know the situation well 
enough to ask for, In some cases, it is desira- 
ble to have the interviews recorded elec- 
trically and transcribed. This increases the 
work load substantially, and trained inter- 
viewers can usually get satisfactory reports at 
the time or by editing their notes shortly after 
the interview. 

Usually the interviewer should apply cer- 
tain criteria to the incidents while they are be- 
ing collected. Some of the more important cri- 
teria are: (a) is the actual behavior reported; 
(b) was it observed by the reporter; (c) were all 

t factors in the situation given; (d) has 
the observer made a definite judgment regard- 
ing the criticalness of the behavior; (e) has the 
observer made it clear just why he believes the 
behavior was critical. 

In Fig. 3 is shown a sample of the type of 
form used by interviewers to collect critical in- 
cidents. Of course the form must be adapted 
to the needs of the specific situation. 

b. Group interviews. Because of the cost in 
time and personnel of the individual interview, 
a group interview technique has been devel- 
oped. This retains the advantages of the indi- 
vidual interview in regard to the personal con- 
tact, explanation, and availability of the inter- 
viewer to answer questions. To some extent it 
also provides for a check on the data supplied 
by the interviewees, Its other advantages are 
that the language of the actual observer is pre- 
cisely reproduced and the time for editing the 
interviews is virtually eliminated. 

The method consists of having the inter- 
viewer give his introductory remarks to a 
group very much as he would do in an individ- 
ual interview. There is an opportunity for 
questions and clarification. Then each person 
is asked to write incidents in answer to specific 
questions contained on a specially prepared 
form. The size of the group which can be han- 
dled effectively will vary with the situation. 
If the group is fairly small, it is usually possi- 
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“Think of the last time you saw one 
of your subordinates do something that 
was very helpful to your group in 
meeting their production schedule." 
(Pause till he indicates he has such an 
incident in mind.) “Did his action re- 
sult in increase in production of as much 
as one per cent for that day?—or some 
similar period?” 

(If the answer is “no,” say) “I won- 
der if you could think of the last time 
that someone did something that did 
have this much of an effect in increasing 
production." (When he indicates he has 
such a situation in mind, say) "What 
were the general circumstances leading 


up to this incident?" 


“Tell me exactly what this person did 
that was so helpful at that time." 


———— 


"Why was this so helpful in getting 


your group's job done?” 


o i DANNES H 
“When did this incident happen?” 
i eS 


“What was this person's job?” ——— 


lo Do MTMedpRMNCREMEALOr .— 
"How long has he been on this job?" 


cac AM ELSE s 


“How old is he?" ——————— 


Fic. 3. SAMPLE or A FORM FOR USE BY AN 
INTERVIEWER IN CoLLECTING EFF 
CRITICAL INCIDENTS 


ble for the interviewer to read the responses of 
each member of the group to the first qu 1 
and make sure that he understands what 
wanted. There seems to be a certain amount 
of social facilitation, and the results in 
situations have been excellent. In the report 
of the first use of this procedure by Wagner 
(65), the amount of interviewer time require". 
per usable incident was 4.3 minutes for 
group interview procedure as com 


15.7 minutes for individual interviews. The 
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to be about thc same for the two situations. 

€. Questionnaires. If the group becomes 
large, the group interview procedure is more in 
the nature of a questionnaire procedure. 
There are, of course, all types of combinations 
of procedures that can be used. The one that 
is most different from those discussed is the 
mailed questionnaire. In situations where the 
observers are motivated to read the instruc- 
tions carefully and answer conscientiously, 
this technique seems to give results which are 
not essentially different from those obtained 
by the interview method. Except for the ad- 
dition of introductory remarks, the forms used 
in collecting critical incidents by means of 
mailed questionnaires are about the same as 
those used in group interviews. 

4. Record forms. One other procedure for 
collecting data is by means of written records. 

are two varieties of recording: one is to 

record details of incidents as they happen. 
This situation is very similar to that described 
in connection with obtaining incidents by in- 
terviews above, except that the observation 
and giving of incidents are delayed following 
the introductory remarks and the presentation 
of the questions until an incident is observed 
to happen. 

A variation of this procedure is to record 
such incidents on forms which describe most of 
the possible types of incidents by placing a 

i check or tally in the appropriate place. 

i As additional information becomes availa- 
ble on the nature of the components which 
make up activities, observers may thus collect 
data more efficiently by using forms for record 
ing and classifying observations. In the mean- 
time, because of the inadequacy of the infor- 
mation currently available regarding these 

components, it seems desirable to ask observ- 

ers to report their observations in greater de- 
tail and have the classification done by spe- 
cially trained personnel. 

Size of sample. A general problem which 
Overlaps the phases of collecting the incidents 
ànd analyzing the data relates to the number 
of incidents required. There does not appear 
to be a simple answer to this question. If the 
activity or job being defined is relatively sim- 
106 it may be satisfactory to collect only 50 or 
E Incidents. On the other hand, some types 

complex activity appear to require several 

sand incidents for an adequate statement 
requirements, 

a he most useful procedure for determining 

5 ether or not additional incidents are needed 

a to keep a running count on the number of 

ew critical behaviors added to the classifica- 
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that adequate 
when the addition of 100 critical incidents to 
the sample adds only two or three critical be- 
haviors. For jobs of a supervisory nature, it 
appears that between 2,000 and 4,000 critical 
incidents are required to establish a compre- 
hensive statement of requirements that in- 
cludes nearly all of the different types of criti- 
cal behaviors. For semiskilled and skilled jobs 
between 1,000 and 2,000 incidents seem to be 


incidents has been collected. If a relatively 
precise definition of each critical behavior cat- 
egory is required, it may be necessary to getat 
least three or four examples of each critical be- 
havior. Similarly, if the critical incidents are 
to be used as a basis for developing selection 
tests, training materials, and 

measures, more incidents may be required to 
provide a sufficient supply of usable for 
the development of these materials. 

In summary, although there is no simple 
formula for determining the number of critical 
incidents that will be required, this is a very 
important consideration in the plan of the 
study; checks should be made both on the first 
hundred or so incidents and again after ap- 
proximately half of the number of incidents 
believed to be required have been obtained in 
order to make it possible to revise the prelimi- 
nary estimates, if necessary, with a minimum 
loss in effort and time. 


4. Analysing the Data 


The collection of a large sample of 
incidents that fulfill the various con- 
ditions outlined above provides a 
functional description of the activity 
in terms of specific behaviors. If the 
sample is representative, the judges 
well qualified, the types of judgments 
appropriate and well defined, and the 
procedures for observing and report- 
ing such that incidents are reported 
accurately, the stated requirements 
can be expected to be comprehensive, 
detailed, and valid in this form. 
There is only one reason for going 
further and that is practical utility. 
The purpose of the data analysis 
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stage is to summarize and describe 
the data in an efficient manner so 
that it can be effectively used for 
many practical purposes. 

In the discussion which follows, it 
should be kept in mind that the 
process of description has been com- 
pleted. The specific procedures to be 
discussed are not concerned with im- 
proving on the comprehensiveness, 
specificity of detail, or validity of the 
statement of the requirements of the 
activity. Rather, they are concerned 
with making it easier to report these 
requirements, to draw inferences 
from them, and to compare the ac- 
tivity with other activities. 

The aim is to increase the useful- 
ness of the data while sacrificing as 
little as possible of their compre- 
hensiveness, specificity, and validity. 
It appears that there are three pri- 
mary problems involved: (a) the 
selection of the general frame of 
reference that will be most useful for 
describing the incidents; (5) the in- 
ductive development of a set of 
major area and subarea headings; 
and (c) the selection of one or more 
levels along the specificity-generality 
continuum to use in reporting the 
requirements. Each of these prob- 
lems will be discussed below: 

a. Frame of reference. There are countless 
ways in which a given set of incidents can be 
classified. In selecting the general nature of 
the classification, the principal consideration 
should usually be that of the uses to be made 
of the data. The preferred categories will be 
those believed to be most valuable in using the 
statement of requirements. Other considera- 
tions are ease and accuracy of classifying the 
data, relation to previously developed defini- 
tions or classification systems, and considera- 
tions of interpretation and reporting, which 
will be discussed in a later section. 

For job activities, the choice of a frame of 
reference is usually dominated by considera- 
tions of whether the principal use of the re- 
quirements will be in relation to selection, 
training, measurement of proficiency, or the 
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development of procedures for luati 
the-job effectiveness. For selection p 
the most appropriate classification 
psychological one. The main headings has 
do with types of psychological traits that 
utilized in the selection process. For trainin 
uses, the best classification system follows 
set of headings that is easily related to training. 
courses or broad training aims. For pro 
ficiency measurement, the headings tend to be 
similar to those for training except that th 
is less attention to possible course organi 
and aims and greater attention to the € 
ponents of the job as it is actually perfo 
For the development of procedures for 
ating on-the-job effectiveness to establi 
criterion of success, the classification syst 
necessarily directed at presenting the on 
job behaviors under headings that rep 
either well-marked phases of the job or 
vide a simple framework for classifying on-tl 
job activities that is either familiar to or easil; 
learned by supervisors. 
Similarly, in nonvocational activities the 
frame of reference depends on the uses d 
for the findings. For example, if a study is be- — 
ing made to define immaturity reactions in 
military personnel, the frame of 
would depend somewhat on whether the func- 
tional description is to be used primarily to 
identify personnel showing this type of malad- 
justment or whether the principal use will be — 
to try to prepare specifications for types of 
situations in which immaturity reactions 
would not lead to serious difficulties. —  .— 
b. Category formulation. The induction of 
categories from the basic data in the form of 
incidents is a task requiring insight, experi- — 
ence, and judgment. Unfortunately, this pro- 
cedure is, in the present stage of ps ologi 
knowledge, more subjective than objective. 
No simple rules are available, and the quality 
and usability of the final product are la ^ 
dependent on the skill and sophistication 
the formulator. One rule is to submit the ten- 
tative categories to others for review. 
though there is no guarantee that 
agreed on by several workers will be more use- 
ful than those obtained from a single worker, - 
the confirmation of judgments by a number of 
persons is usually reassuring. The usual pro- 
cedure is to sort a relatively small sample of 
incidents into piles that are related to 5 
frame of reference selected. After these tenta- 
tive categories have been established, 3 
definitions of them are made, and add nui 
incidents are classified into them. During bun 
process, needs for redefinition and for the de- 
velopment of new categories are noted. The 
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tentative categories are modified as indicated 
and the process continued until all the inci- 
dents have been classified. 

The larger categories are subdivided into 
smaller groups and the incidents that describe 
very nearly the same type of behavior are 
placed together. The definitions for all the 
categories and major headings should then be 
reexamined in terms of the actual incidents 
classified under each. 

c. General behaviors. The last step is to de- 
termine the most appropriate level of specific- 
ity-generality to use in reporting the data. 
This is the problem of weighing the advan- 
tages of the specificity achieved in specific inci- 
dents against the simplicity of a relatively 
small number of headings. The level chosen 
might be only a dozen very general behaviors 
or it might be several hundred rather specific 
behaviors. Practical considerations in the im- 
mediate situation usually determine the opti- 
mal level of generality to be used. 

Several considerations should be kept in 
mind in establishing headings for major areas 
and in stating critical requirements at the se- 
EM level of generality. These are listed be- 


_ (i) The headings and requirements should 
indicate a clear-cut and logical organization. 
They should have a discernible and easily re- 
membered structure. 

(ii) The titles should convey meanings in 
themselves without the necessity of detailed 
definition, explanation, or differentiation. 

is does not mean that they should not be 
defined and explained. It does mean that these 
titles, without the detailed explanation, 
should still be meaningful to the reader. 

(iii) The list of statements should be ho- 
Mogeneous; i.e., the headings for either areas 
9r requirements should be parallel in content 
and structure. Headings for major areas 
should be neutral, not defining either unsatis- 
factory or outstanding behaviors. Critical re- 
quirements should ordinarily be stated in posi- 
tive terms. 

(iv) The headings of a given type should all 
be of the same general magnitude or level of 
importance. Known biases in the data causing 
9ne area or one requirement to have a dispro- 
Portionate number of incidents should not be 
Teflected in the headings. 

(v) "The headings used for classification and 
Teporting of the data should be such that find- 
ings in terms of them will be easily applied and 
maximally useful. 

(vi) The list of headings should be compre- 

sive and cover all incidents having signifi- 
cant frequencies, 
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5. Interpreting and Reporting 

It is never possible in practice to 
obtain an ideal solution for each of 
the practical problems involved in 
obtaining a functional description of 
an activity. Therefore, the statement 
of requirements as obtained needs 
interpretation if it is to be used 
properly. In many cases, the real 
errors are made not in the collection 
and analysis of the data but in the 
failure to interpret them properly. 
Each of the four preceding steps, (a) 
the determination of the general aim, 
(b) the specification of observers, 
groups to be observed, and observa- 
tions to be made, (c) the data collec- 
tion, and (d) the data analysis, must 
be studied to see what biases have 
been introduced by the procedures 
adopted. If there is a division of 
opinion as to the general aim and one 
of the competing aims is selected, 
this should be made very clear in the 
report. If the groups on whom the 
observations are made are not repre- 
sentative of the relevant groups in- 
volved, they must be described as 
precisely as possible. The aim of the 
study is usually not a functional de- 
scription of the activity as carried on 
by this sample but rather a state- 
ment relating to all groups of this 
type. In order to avoid faulty in- 
ferences and generalizations, the limi- 
tations imposed by the group must 
be brought into clear focus. Simi- 
larly, the nature of judgments made 
in collecting and analyzing the data - 
must be carefully reviewed. 

While the limitations need to be 
clearly reported, the value of the 
results should also be emphasized. 
Too often the research worker shirks 
his responsibility for rendering a 
judgment concerning the degree of 
credibility which should be attached 
to his findings. It is a difficult task, 
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but if the results are to be used, 
someone will have to make such a 
judgment, and the original investi- 
gator is best prepared to make the 
necessary evaluations either for the 
general case or for certain typical 
specific examples. 


USES or THE CRITICAL INCIDENT 
TECHNIQUE 


The variety of situations in which 
the collection of critical incidents will 
prove of value has only been par- 
tially explored. In the approximately 
eight years since the writer and his 
colleagues began a systematic for- 
mulation of principles and procedures 
to be followed in collecting this type 
of data, a fairly large number of ap- 
plications has been made. The appli- 
cations will be discussed under the 
following nine headings: (a) Meas- 
ures of typical performance (criteria); 
(b) measures of proficiency (standard 
samples); (c) training; (d) selection 
and classification; (e) job design and 
purification; (f) operating procedures; 
(g) equipment design; (k) motivation 
and leadership (attitudes); (7) coun- 
seling and psychotherapy. 

Space is not available here to de- 
scribe these various applications in 
detail. However, a brief description 
of the types of application that have 
been made, along with brief illustra- 
tive examples and references, will be 
presented, Some of the studies in- 
volve several of the types of applica- 
tions to be discussed. The presenta- 
tion is not intended to be complete, 
but rather to give the reader inter- 
ested in further study some orienta- 
tion and guidance. 

Measures of typical performance 
(criteria). The simplest and most 
natural application of a systematic- 
ally collected set of critical incidents 
is in terms of the preparation of a 

statement of critical requirements 
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and a check list or some similar type 
of procedure for evaluating the typi- 
cal performance of persons engaged 
in this activity. If an observational 
check list that includes all of the im- 
portant behaviors for the activity is 
available, the performance of the in- 
dividual can be objectively evaluated 
and recorded by merely making a 
single tally mark for each observa- 
tion. Such records provide the essen- 
tial basis for criterion data which are 
sufficiently detailed and specific for 
special purposes but at the same time 
can be combined into a single over-all 
evaluation when this is desirable; 
Such a procedure was first suggested © 
and tried out in connection with de- 
velopmental studies of the American 
Institute for Research. These in- 
cluded: Preston's study of officers for 
the United States Air Force (52); 
Nagay's study on air route traffic 
controllers for the Civil Aeronautics 
Administration (49); and M. H. 
Weislogel's study on research per- 
sonnel for the Office of Naval Re- 
search (69). Another American In- 
stitute for Research study was re- | 
ported by R. B. Miller and the 
present author (21). This was a per- 
formance record form for hourly 
wage employees developed in coopera- 
tion with personnel of the Delco- - 
Remy Division of the General Motors 
Corporation, the Employment Prac- 
tices Division of that corporation, 
and the Industrial Relations Center 
of the University of Chicago. The 
same authors have developed similar 
performance records for salaried em- 
ployees, and foremen and supervisors 
(22, 23). The principles and pro- 
cedures underlying this type of evalu- 
ation of performance have been pu 
lished elsewhere (14, 15,17). 

A number of important contribu- 
tions to the development of func- 
tional descriptions and standards of 
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performance have been made by 
other groups using the critical inci- 
dent technique. One of the most 
notable of these is the development 
by Hobbs et al. (3, 31), of Ethical 
Standards of Psychologists. More 
than 1,000 critical incidents involving 
ethical problems of psychologists 
were contributed by the members of 
the American Psychological Associa- 
tion. It is believed that this repre- 
sents the first attempt to use empiri- 
cal methods to establish ethical 
standards. Because of the impor- 
tance of this study, and the generality 
of some of the problems involved, 
certain of the conclusions reported by 
the Committee on Ethical Standards 
for Pyschology in their introductory 
statement will be quoted here. 


First, it is clear that psychologists believe 
that ethics are important; over two thousand 
Psychologists were sufficiently concerned with 
the ethical obligations of their profession to 
contribute substantially to the formulation of 
these ethical standards. Second, psychologists 
believe that the ethics of a profession cannot be 
Prescribed by a committee; ethical standards 
must emerge from the day-by-day value com- 
mitments made by psychologists in the prac- 
tice of their profession. Third, psychologists 
Share a conviction that the problems of men, 
even those involving values, can be studied 
objectively; this document summarizes the re- 
sults of an effort to apply some of the tech- 
miques of social science to the study of ethical 

ehavior of psychologists. Fourth, psycholo- 
Sists are aware that a good code of ethics must 
k more than a description of the current sta- 
us of ethics in the profession; a code must em- 
py the ethical aspirations of psychologists 
encourage changes in behavior, bringing 
performance ever closer to aspiration. Fifth, 
peychologiste appreciate that process is often 
m a cortent than product in influencing 
iM: ul ehavior; the four years of widely- 
E porkin developing thiscodè are counted 
val more influential in changing ethical 
4 Sal! of psychologists than will be the pub- 
e of this product of their work. Finally, 
A Ologists recognize that the process of 
the ed ethical standards must be a continu- 
Sean occasional publications such as this 
en ent mark no point of conclusion in the 
ing process of defining ethical standards 
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—they are a means of sharing the more essen- 
tial discipline of examining professional expe- 
rience, forming hypotheses about professional 
conduct, and testing these hypotheses by ref- 
erence to the welfare of the people affected by 
them (3, p. v). 


In addition to the study by Smit 
mentioned in a previous section (58), 
several other studies on the use of the 
critical incident procedures as a 
basis for evaluating teaching effec- 
tiveness have been reported. One of 
these was a study conducted under 
the joint sponsorship of the Educa- 
tional Research Corporation and the 
Harvard University Graduate School 
of Education with funds provided by 
the New England School Develop- 
ment Council and the George F. 
Milton Fund. This was an explora- 
tory study of teacher competence 
reported by Domas (6). Approxi- 
mately 1,000 critical incidents were 
collected from teachers, principals, 
and other supervisors. Although this 
was an exploratory study, it was felt 
that it made an important contribu- 
tion to the general problem of relat- 
ing salary to teacher competence. 

The second of these studies was 
conducted as part of the teacher 
characteristics study sponsored by 
the American Council on Education 
and subsidized by the Grant Founda- 
tion. This study is reported by 
Jensen (32). Teachers, administra- 
tors, and teachers in training in the 
Los Angeles area contributed more 
than 1,500 critical incidents of teacher 
behavior. The incidents were classi- 
fied under personal, professional, and 
social qualities. The category formu- 
lation indicated that there were about 
20 distinct critical requirements, 
These were recommended as a basis 
for teacher evaluation and as an aid 
to the in-service growth of teachers. 

Another study was that of Smith 
and Staudohar (59), which deter- 
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mined the critical requirements for 
basic training of tactical instructors 
in the United States Air Force. From 
130 training supervisors, 555 tactical 
instructors, and 3,082 basic trainees, 
a total of 6,615 usable incidents were 
obtained. The authors comment 
that: 

The training supervisors report a predomi- 
nance of ineffective incidents in the major 
areas of: Sets a good example and maintains 
effective personal relations. The tactical in- 
structors report more ineffective incidents in 
the area of Makes his expectations clear. Ba- 
sic trainees show a predominance of ineffec- 
tive incidents in three areas: Sets a good ex- 
ample, Considers trainee's needs, and Main- 
tains effective personal relations (59, p. 5). 

Another study on the evaluation of 
instructor effectiveness was carried 
out by Konigsburg (33). This study 
involved the development of an in- 
structor check list for college in- 
structors based on the critical in- 
cident technique and a comparison 
of techniques for recording observa- 
tions. Its principal findings were the 
very low correlation coefficients be- 
tween the total scores from the 
Purdue Rating Scale for Instruction 
and the instructor check list. When 
these two instruments were each 
given to half the class on the same 
day, the average correlation coeffi- 
cient was found to be .29. The other 
principal finding is that the planned 
performances of a total of 46 pre- 
determined behaviors were better 
reflected by the results obtained on 
the instructor check list than by the 
results on the Purdue Rating Scale. 

A somewhat related study has been 
reported by Barnhart (4). This 
study collected a large number of 
critical incidents for the purpose of 
establishing the critical requirements 
for school board membership. The 
author applied his findings to the 
problem of evaluating the effective- 
ness of school board members. 
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Another type of application of the 
critical incident technique to the de- 
velopment of bases for evaluating be- 
havior is the previously mentioned 
study of Eilbert (7). His list of 51 
types of immature reaction based on 
a collection of several hundred criti- 
cal incidents describing manifesta- 
tions of emotional immaturity is be- 
lieved to provide a useful guide to 
further investigation and appraisal of 
persons with behavior problems. It 
is believed that the results of this 
study provide substantial encourage- 
ment to the application of the critical 
incident technique to similar prob- 
lems in the field of clinical diagnosis 
and evaluation. 

Measures of proficiency (standard 
samples). A closely related use of 
critical incidents is to provide a basis 
for evaluating the performance of 
persons by use of standard samples of 
behavior involving important as- 
pects of the activity. Such evalua- 
tions are called proficiency measures 
and are differentiated from the evalu- 
ation of typical performance on the 
job primarily on the basis that a test 
situation rather than a real job situa- 
tion is used. Measures of this sort are 
especially useful at the end of train- 
ing courses as checks on the main- 
tenance of proficiency, and when the 
tasks assigned to participants vary à 
great deal in difficulty or are not di- 
rectly observed by the supervisors. 

One of the first applications © 
critical incidents to the development 
of proficiency measures was Gordon $ 
study on the development of a stand- 
ard flight check for the airline trans- 
port rating (28, 29). This study was 
done by the American Institute for 
Research under the sponsorship 9 
the National Research Council Com- 
mittee on Aviation Psychology with 
funds provided by the Civil Aero- 
nautics Administration. In this 
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study data from analyses of airline 
accidents were combined with critical 
incidents reported by airline pilots to 
provide the basis for developing an 
objective measure of pilot profi- 
ciency. The flight check consisted of 
the presentation of situations pro- 
viding uniformly standardized op- 
portunities to perform the critical 
aspects of the airline pilot's job as 
indicated from the study of the acci- 
dents and critical incidents reported. 
The new check was found to yield 88 
per cent agreement on the decision to 
pass or fail a particular pilot when 
examined on flights on successive 
days by different check pilots. The 
previous flight check when used on 
the same flights gave only 63 per cent 
agreement, which was little better 
than chance under the conditions of 
the study. 

Similar studies on the development 
of flight checks at the American In- 
stitute for Research have been carried 
out by Marley (36, 37), G. S. Miller 
(39), and Ericksen (9). These studies, 
sponsored by the United States Air 
Force and the Civil Aeronautics Ad- 
ministration, were concerned re- 
spectively with objective flight 
checks for B-29 bombing crew mem- 
bers, B-36 bombing crew members, 
and private pilots flying light civilian 
aircraft. Ericksen also developed a 
light plane proficiency check to pre- 
dict military flying success (10) on 
à similar project sponsored by the 
United States Air Force Human Re- 
Sources Research Center. 

A similar set of proficiency meas- 
ures was developed by Krumm for 
Air Force pilot instructors (34, 35), 
also under the sponsorship of the 
en Resources Research Center. 
Ps ese measures were based on more 
fr oo 4,000 critical incidents collected 
ny student pilots, flight instructors, 

supervisors. The critical inci- 
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dents were classified under three 
main headings: (a) proficiency as a 
pilot; (b) proficiency as a teacher; 
and (c) proficiency in maintaining 
effective personnel relations. The 
proficiency measures developed in 
connection with this study included 
paper-and-pencil tests presenting cri- 
tical situations and requiring the 
instructor to select one of several 
proposed solutions. 

Another development of this type 
carried on at the American Institute 
for Research was the construction of 
tests for evaluating research pro- 
ficiency in physics and chemistry for 
the Office of Naval Research by 
M. H. Weislogel (71). This study 
was based on the critical incidents for 
research personnel (20) discussed in 
a previous section. The items for 
these proficiency measures were 
based on detailed rationales. The 
items described a practical research 
situation in considerable detail and 
outlined five specific choices concern- 
ing such matters as the best thing to 
do next, suggestions for improving 
the procedure as reported, etc. The 
critical behaviors tested in the items 
were taken directly from the critical 
incidents. The method of developing 
tests through the use of comprehen- 
sive rationales has been discussed 
generally in another paper (16). 

Three studies have been reported 
by the American Institute for Re- 
search in which critical incidents 
were used as a basis for developing 
situational performance tests for 
measuring certain aspects of the pro- 
ficiency of military personnel. These 
included the study of Sivy and Lange 
on the development of an objective 
form of the Leaders Reaction Test 
for the Personnel Research Branch, 
Department of the Army (57). This 
test included 20 situational problems 
based on the critical requirements of 
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the noncommissioned combat infan- 
try leader as determined on the basis 
of critical incidents collected in mili- 
tary maneuvers and during combat 
operations at the front in Korea. A 
second proficiency measure of a 
somewhat similar sort was developed 
for other types of personnel by R. L. 
Weislogel (73). The third study of 
this type was carried out by Suttell 
(61) for the Human Resources Re- 
search Center. This study was based 
on critical incidents collected in pre- 
vious studies of the American Insti- 
tute for Research and reported the 
development and preliminary evalua- 
tion of the Officer Situations Test. 
This test was designed to measure 
nonintellectual aspects of officer per- 
formance through the use of 16 
situational problems requiring about 
six hours of testing time. 

Because of the great difficulty in 
obtaining valid and reliable measures 
of typical performance, accurate 
measures of proficiency are essential 
for many types of activities. It is 
apparent that a comprehensive set of 
critical incidents can be of great 
value in constructing such measures. 

Training. Many of the applica- 
tions of the critical incident technique 
to training problems have been car- 
ried out for the military in special 
situations so that the reports are 
classified security information. In 
addition to work by Preston, Glaser, 
and R. L. Weislogel, R. B. Miller and 
Folley have utilized critical inci- 
dents in establishing training require- 
ments for specific types of main- 
tenance mechanics (47) in a study for 
the Human Resources Research Cen- 
ter. 

Similarly, Ronan has used critical 
incidents as a basis for developing a 
program of training for emergency 
procedures in multi-engine aircraft 
(54) in a study for the United States 
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Air Force Human Factors Operations 
Research Laboratory. On the basis 
of several thousand incidents re- 
ported by aircrew personnel regard- 
ing emergencies, three evaluation 
devices were prepared. These in- 
volved a conventional type multiple 
choice test; a special multiple-choice 
test designed to measure the indi- 
vidual's information concerning the 
important cues in the emergency 
situation, the appropriate actions to 
be taken, and the basic troubles or 
causes of the emergency; and à 
“flight check” to be used in evaluat 
ing the performance of aircrew mem- 
bers in electronic flight simulators. 

The obvious rele ince of the be- 
haviors involved in critical incidents 
and the specific details included make 
such incidents an ideal basis for de- 
veloping training programs and train- 
ing materials. 

A recent study by Collins (5) uses 
critical incidents as a basis for evalu- 
ating the effectiveness of a training 
program. The types of incidents re- 
ported by mothers after a two-week 
training course were significantly dif- 
ferent from those reported at the 
beginning of the program in a number 
of aspects relevant to the objectives 
of the program. The critical inci- 
dents appeared to provide a much F 
more sensitive basis for revealing 
changes than other procedures used. 

Selection and classification. Until 
recently, the customary approach ofi 
the research psychologist to the de- 
velopment of tests for selection and 
classification purposes has been 8$ 
follows: A very brief period was given 
to study of the job. Following this, 
a wide variety of selection procedures | 
was administered to a group of ap- 
plicants or employees, and follow-up — 
data were gathered. Since the rê- 
search psychologist had little con- 
fidence in the accuracy of his analyst 
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of the psychological elements re- 
quired by the job, there was à tend- 
ency to try everything that was 
available and seemed even remotely 
related to the tasks involved. This 
has been called the "shotgun ap- 
proach.” It was hoped that with a 
wide scatter at least a few of the tests 
would pay off. The critical incident 
technique has lent substantial sup- 
port to the more thorough study of 
the job prior to initiating testing pro- 
cedures. There is increasing feeling 
at the present time that a much 
larger percentage of the investiga- 
tor's time should be spent on deter- 
mining the critical requirements of 
the job, so that the psychologist will 
have sufficient confidence in his 
tentative conclusions as to the nature 
of the important selection procedures 
to permit their use on a tentative 
basis prior to the collection of em- 
pirical follow-up data. This is espe- 
cially important in those situations 
where the follow-up requires a very 
long period of time or where the 
number of cases that can be followed 
up is so small that definitive findings 
cannot be anticipated. 

One of the most important re- 
quirements for developing a system 
of job analysis that will facilitate a 
relatively accurate identification of 
the important job elements for a 
Specific task is to establish a clear and 
Specific set of definitions for these job 
elements in behavioral terms. The 
American Institute for Research has 
carried out a series of projects on this 
Problem. The first of these was a 
Study undertaken by Wagner under 
end Sponsorship of the United States 

ir Force School of Aviation Medi- 
cine to define the requirements of air- 
ah jobs in terms of specific job 
elements (67, 68). Several thousand 
al incidents were gathered from 
ircrew members, and these were 
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classified into 24 job elements. These 
job elements were inductively formu- 
lated from the critical incidents and 
were grouped under the four area 
headings: (a) learning and thinking; 
(b) observation and visualization; 
(c) sensory-motor coordination ; and 
(d) motives, temperament, and lead- 
ership. 

The development of more than 100 
proficiency tests to measure each of 
the various critical behaviors in- 
cluded in the 24 tentatively proposed 
job elements was reported by Hahn 
(30) for the School of Aviation Medi- 
cine. These tests were administered 
to a group of approximately 500 high 
school senior boys, and the intercor- 
relations were used to reformulate 
the tentative job elements. In a 
study just completed by Taylor (62) 
for the Human Resources Research 
Center, the results of applying an 
analytical procedure developed by 
Horst to study the interrelationships 
involved are reported. This analysis 
led to the formulation of a new set of 
20 job elements for each of which a 
selection. test has been developed. 
These tests have been administered 
to several hundred aviation cadets 
and follow-up data on their success in 
aircrew training should be available 
soon. 

A similar project based on critical 
incidents collected from various ci- 
vilian jobs has been reported by the 
present author (2, 18, 19). The 
Flanagan Aptitude Classification 
Test Series, published in 1953, pro- 
vides aptitude measures for 14 criti- 
cal job elements. The Applicant 
Inventory, also published in 1953, 
measures attitudes predictive of job 
adjustment for hourly wage. em- 
ployees. 

‘An effort to adapt the critical in- 
cident technique to the problem of 
developing civil service examinations 
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is reported by Wager and Sharon 
(64). In an exploratory study they 
collected about 100 incidents regard- 
ing on-the-job behaviors of main- 
tenance technicians. These incidents 
were used as a basis for determining 
job requirements in terms of be- 
havior, and test items were developed 
for use in selecting applicants who 
could be expected to meet these re- 
quirements. 

Another study that used critical 
incidents as a basis for developing 
tests to predict performance was car- 
ried out by O'Donnell (51). His test, 
designed to predict success in den- 
tistry, was based on critical incidents 
collected by Wagner. The test in- 
cludes items designed to predict, in 
part, the following three general 
areas: (a) demonstrating technical 
proficiency; (b) handling patient re- 
lationships; and (c) accepting pro- 
fessional responsibility. A follow-up 
study indicated moderate validity for 
these materials. 

One of the few studies known to 
the author in which the critical inci- 
dent technique was used in a project 
carried on outside the United States 
is Emons' doctor's dissertation (8). 
'This study, carried out at the Uni- 
versity of Liége, investigated the ap- 
titudes of effective sales personnel in 
a large department store. A group of 
40 supervisors provided 228 critical 
incidents. Nine categories were for- 
mulated from this group of incidents 
and recommendations made for an 
aptitude test to improve current 
selection procedures. 

5. Job design and purification. In- 
adequate attention has been given to 
the scientific design of jobs to pro- 
mote over-all efficiency. Where a 
team has several different types of 
tasks to perform, it is frequently pos- 
sible to design each of the team mem- 
ber’s jobs so that only a few of the 
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several tasks are involved. If the 
jobs have been studied by use of the 
critical incident technique, it may be 
possible to select and train each team 
member for only two or three of the 
critical job elements. This tends to 
maximize the effectiveness of per- 
formance with respect to each of the 
various types of tasks. Although 
such procedures have nearly always 
been informally used in planning the 
work of teams, the critical incident 
technique facilitates the collection of 
the data essential to this type of job 
purification. 

Some preliminary work on this 
problem has been carried out at the 
American Institute for Research. 
Recommendations resulting from 
these studies for reducing the number 
of job elements required in certain 
common maintenance jobs are ex- 
pected to lead to a saving of millions 
of dollars in training costs as well as 
to improving the effectiveness of job 
performance. 

Operating procedures. Another ap- 
plication of critical incidents which 
has not been adequately exploited is 
the study of operating procedures. 
Detailed factual data on successes 
and failures that can be systematic- 
ally analyzed are of great importance 
in improving the effectiveness and 
efficiency of operations. Such infor- 
mation can be efficiently collected by 
means of the critical incident tech- 
nique. 

Examples of such studies are pro- 
vided by a series of three projects 
carried out by the American Institute 
for Research under the sponsorship 
of the United States Air Force 
School of Aviation Medicine. — The 
first of these involves the collection 9 
critical incidents relating to near at- 
cidents in flying reported by Vasilas, 
Fitzpatrick, DuBois, and Youtz (63). 
More than 1,700 critical incidents 
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were collected from pilots and other 
aircrew members by procedures de- 
veloped for this study. These inci- 
dents pointed to possible improve- 
ments in training job design and 
equipment design as well as in operat- 
ing procedures. 

The second of these studies was 
specifically concerned with the effect 
of the age of pilots and other crew 
members on aircrew operations. This 
study was reported by Shriver (56), 
and included tentative suggestions 
regarding various modifications in 
operating procedures. 

The third study in this series, re- 
ported by Goodenough and Suttell 
(26), involved the collection of criti- 
cal incidents regarding the impair- 
ment of human efficiency in emer- 
gency operations. These incidents 
provide a detailed statement of both 
the types of stresses that impair per- 
formance and the types of perform- 
ance that are impaired under specific 
conditions. More than 2,000 critical 
incidents were collected in which 
impairment in performance on opera- 
tional assignments was observed. 
These incidents were collected in 
Alaska and the Far East as well as in 
operational commands in the United 
States. "This report contains sugges- 
tions for improving operations in 
emergency situations. 

Equipment design. An application 
closely related to that just discussed 
involves the collection of critical in- 
Cidents to improve the design of 
equipment. Reports of specific in- 
Cidents from the field have always 
“han a basis for equipment modifica- 
E The critical incident technique 
a 1 Tn the collection and process- 
E of this type of information. Too 
: en in the past action was taken on 
E € basis of informal reports from 
Een personnel. The collection 

arge numbers of critical incidents 
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representative of operating experi- 
ence provides a sound basis for modi- 
fying existing equipment and design- 
ing new models. 

In the study by Fitts and Jones 
(12), mentioned above, which was 
carried out at the Aero-Medical 
Laboratory, 270 critical incidents 
relating to errors in reading and in- 
terpreting aircraft instruments were 
collected and analyzed. These led to 
a number of specific suggestions re- 
garding modifications in instrument 
displays. 

Other recent studies conducted at 
the American Institute for Research 
have used data from the critical in- 
cident technique along with other 
sources to develop procedures for 
designing jobs. The reports on these 
projects are classified for military 
security reasons. 

Other projects at the American 
Institute for Research have used the 
critical incident technique as a sup- 
plemental procedure for task analysis 
of equipment in the design stage of 
development (9, 10, 34, 35, 39). 
These procedures have been found 
very effective when used by psy- 
chologists working closely with en- 
gineers on the preparation of design 
specifications for new equipment. 

Motivation and leadership. The 
study of attitudes has been somewhat 
limited and difficult to interpret be- 
cause of the almost exclusive reliance 
on verbal statements of opinions and 
preferences. The critical incident 
technique has been applied in a few 
instances to gather factual data re- 
garding specific actions involving de- 
cisions and choices. These studies 
suggest that critical incidents of this 
type may be a very valuable supple- 
mentary tool for the study of atti- 
tudes. 

A recent study carried out by 
Preston of the American Institute for 
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Research for the Air Force's Human 
Resources Research Center (53) used 
critical incidents as a basis for study- 
ing decisions of airmen to re-enlist in 
the Air Force. It is believed that 
these specific incidents provide valu- 
able information not contained in 
studies utilizing only data on opin- 
ions. 

A series of reports by Ruch (55) 
contains critical incidents on combat 
leadership collected from senior offi- 
cers in the Far East Air Forces. 
These incidents provide a factual 
basis for the study of motivation and 
leadership of Air Force personnel 
engaged in combat operations. 

Counseling and psychotherapy. An- 
other field in which current tech- 
niques emphasize over-all impres- 
sions, opinions, and reports of single 
cases is counseling and psychother- 
apy. There appears to be a trend, 
however, in this field toward em- 
phasizing the collection of factual 
incidents. This suggests that the 
critical incident technique may be 
useful in this area also. 

Exploratory work has recently 
been done at the University of Pitts- 
burgh with the critical incident tech- 
nique to establish areas of change 
accompanying psychotherapy. A 
series of three master's theses were 
carried out by Speth, Goldfarb, and 
Mellett (25, 38, 60). They collected 
243 critical incidents from 11 psycho- 
therapists. These incidents were col- 
lected about patients who had shown 
improvement and were replies to the 
question, "What did the patient do 
that was indicative of improve- 
ment?" Although these studies were 
primarily exploratory in nature, the 
tentative finding that different thera- 
pists stress different criteria of im- 
provement and nonimprovement sug- 
gests that the critical incident ap- 
proach may be of use not only in de- 
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veloping objective measures of im- 
provement but also in experimental 
studies of the types of improvement 
resulting from the therapists' use of 
specific procedures. 

A somewhat related type of study 
initiated by Diederich and reported 
by Allen (1) describes the use of the 
technique to obtain critical incidents 
from students reporting things that 
caused them to like a fellow high 
school student either more or less 
than before. This study is being con- 
tinued to provide the basis for tests 
of specific value areas. An incidental 
finding of the study was that when 
an example of the kind of incident 
desired was shown on the form, 53 
per cent of the positive and 23 per 
cent of the negative behaviors re- 
ported were in the same category as 
the example given. 


SUMMARY AND CONCLUSIONS 


This review has described the de- 
velopment of a method of studying 
activity requirements called the criti- 
cal incident technique. The tech- 
nique grew out of studies carried out 
in the Aviation Psychology Program 
of the Army Air Forces in World 
War II. The success of the method 
in analyzing such activities as com- 
bat leadership and disorientation m 
pilots resulted in its extension an 
further development after the war. 
This developmental work has been 
carried out primarily at the American 
Institute for Research and the Uni- 
versity of Pittsburgh. The reports of 
this work are reviewed briefly. — . 

The five steps included in the crit- 
ical incident procedure as most 
commonly used at the present time 
are discussed. These are as follows: 
(a) Determination of the general aim 
of the activity. This general aim 
should be a brief statement obtained 
from the authorities in the fie 
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which expresses in simple terms those 
objectives to which most people 
would agree. (b) Development of 
plans and specifications for collecting 
factual incidents regarding the ac- 
tivity. The instructions to the per- 
sons who are to report their observa- 
tions need to be as specific as possible 
with respect to the standards to be 
used in evaluating and classifying the 
behavior observed. (c) Collection of 
the data. The incident may be re- 
ported in an interview or written up 
by the observer himself. In either 
case it is essential that the reporting 
be objective and include all relevant 
details. (d) Analysis of the data. 
The purpose of this analysis is to 
summarize and describe the data in 
an efficient manner so that it can be 
effectively used for various practical 
purposes. It is not usually possible to 
obtain as much objectivity in this 
step as in the preceding one. (e) In- 
terpretation and reporting of the 
Statement of the requirements of the 
activity. The possible biases and im- 
plications of decisions and procedures 
made in each of the four previous 
steps should be clearly reported. The 
research worker is responsible for 
pointing out not only the limitations 
but also the degree of credibility and 
the value of the final results obtained. 
, It should be noted that the critical 
Incident technique is very flexible and 
the principles underlying it have 
od types of applications. Its two 
2 E principles may be summarized 
ollows: (a) reporting of facts re- 
Eco behavior is preferable to the 
E of interpretations, ratings, 
E non based on general im- 
ia (b) reporting. should. be 
Eod to those behaviors which, 
ing to competent observers, 


Ee a significant contribution to 
€ activity. 
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It should be emphasized that criti- 
cal incidents represent only raw data 
and do not automatically provide 
solutions to problems. However, a 
procedure which assists in collecting 
representative samples of data that 
are directly relevant to important 
problems such as establishing stand- 
ards, determining requirements, or 
evaluating results should have wide 
applicability. 

The applications of the critical 
incident technique which have been 
made to date are discussed under the 
following nine headings: (a) Meas- 
ures of typical performance (criteria); 
(b) measures of proficiency (standard 
samples); (c) training; (d) selection 
and classification; (e) job design and 
purification; (f) operating proced- 
ures; (g) equipment design; (Ai) moti- 
vation and leadership (attitudes); 
(i) counseling and psychotherapy. 

In summary, the critical incident 
technique, rather than collecting 
opinions, hunches, and estimates, 
obtains a record of specific behaviors 
from those in the best position to 
make the necessary observations and 
evaluations. The collection and tabu- 
lation of these observations make it 
possible to formulate the critical 
requirements of an activity. A list of 
critical behaviors provides a sound 
basis for making inferences as to re- 
quirements in terms of aptitudes, 
training, and other characteristics. 
It is believed that progress has been 
made in the development of pro- 
cedures for determining activity re- 
quirements with objectivity and pre- 
cision in terms of well-defined and 
general psychological categories. 
Much remains to be done. It is 
hoped that the critical incident tech- 
nique and related developments will 
provide a stable foundation for pro- 
cedures in many areas of psychology. 
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It is customary to distinguish two 
main types of validity; these may be 
named higher-order (internal) and 
lower-order (external) validity. Inter- 
nal validation consists essentially in 
measuring the correlation between 
different tests supposed to measure 
the same variable. Consideration of 

this method of validation will be re- 
served until later, since the tests to be 

evaluated below have not generally 
been validated in this way, and the 
use of this method is comparatively 
rare. Lower-order validity implies 
the use of an external criterion 
against which the test is validated. 

In the development of a test of brain 

damage, for instance, the external cri- 

terion would consist of a number of 
clinical groups, such as a brain-dam- 
aged group of patients, a group of 

Psychiatric (functional) patients 

Without brain damage, and a group of 

Normal controls; a valid test of brain 

damage would then be expected to 

distinguish patients in the brain- 
ing group from those in the 

n er two groups. The difficulty in 

"jd this method of validation lies, 

te course, in the fact that the cri- 

Er itself is in need of validation. 

a as been shown by Ash (5) and 
ers that the reliability of psychi- 


1 . 
oe writer expresses his appreciation to 
a TS of the psychological department of 
fete Ospital for helpful comments on this 
Qus » Which was read at a meeting of the 
BE of Professional Psychologists at 
audsley Hospital on May 9, 1953. The 
E Teported in this paper was made possible 
Ed gant Írom the Research Fund, made 
- ot e ble from the endowment by the Board 
nors of the Bethlem Royal Hospital 


and the Maudsley Hospital. 


atric diagnosis is so low that it is dif- 
ficult to rely on such classifications in 
the development of tests, Even when 
very broad groups are used, the de- 
gree of error may be considerable. It 
seems necessary to use the classifica- 
tion system given above, however, 
because this is the procedure adopted 
by the authors of most of the tests to 
be discussed, because the groups may 
be more reliably discriminated than 
in most cases, and because, in the 
case of the brain-damaged group at 
least, independent confirmation may 
be forthcoming in the shape of neuro- 
logical signs, post-mortem examina- 
tion, etc. 


It is assumed, then, that it is possible to 
isolate such broad groups for the purposes of 
research; however, there remain other condi- 
tions which must be fulfilled before the valid- 
ity of a given test can be discussed. In general, 
these may be summarized as follows: 

a. The test should present data for ade- 
quate samples of the above-mentioned three 
groups (brain-damaged, functional, and nor- 


mal). 

4 The data should be presented in such a 
form as to enable the clinician using the test 
to estimate the degree of possible error when 
assigning a patient to any one of the groups. 
Such data may be stated in three different 
ways. First, the optimum cutoff point may be 
given; this is the point at which it is possible 
to identify as many brain-damaged patients 
as possible and at the same time misclassify 
as few functionals and normals as possible. 
(If the distributions are normal, this can be 
stated simply in terms of the mean and stand- 
ard deviation.) Second, the point beyond 
which no normals or functionals of the sam- 
ples fall should be given. Third, the point be- 

which no brain-damaged patient of the 
sample falls should also be given. 

c. The reliability of the results should be 
verified by applying the test to new groups 
that are independent of the original criterion 
groups. Alternatively (or, preferably, in addi- - 
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tion), the findings should be confirmed by 
another worker in another hospital. 

d. The influence of various factors, such as 
age, sex, and intelligence, and any special 
factors (such as visual acuity when perceptual 
tests are used, and motor coordination when 
mechanical tests are used) should be con- 
trolled. 


It will be the purpose of this paper 
to show that most of the tests pur- 
porting to be measures of brain dam- 
age do not meet the conditions set out 
above and that, therefore, their valid- 
ity either cannot be considered as es- 
tablished or cannot even be evaluated. 
It may be pointed out, however, that, 
even if these conditions were ade- 
quately met, no general judgment 
can be made about the validity of a 
particular test. Whether a given test 
is considered to be a valid measure of 
brain damage, or whether it is not, 
will depend on many factors. The 
most important of these will be 
whether or not the test works in prac- 
tice. Because a test is usually stand- 
ardized on relatively pure groups, the 
discriminating power must neces- 
sarily drop when the test is used clini- 
cally. Again, precise confidence limits 
are difficult to apply. Thus, a test 
that identifies only 20 per cent of 
brain-damaged patients admitted to 
a hospital may be a very useful clini- 
cal instrument if these patients are 
not identifiable at the time of testing 
by any other means. On the other 
hand, a test that identifies 60 per cent 
of all brain-damaged patients may not 
be very useful clinically because most 
of the patients it identifies are obvious 
cases of brain damage and can be de- 
tected by simpler means. In some in- 
stances, it may be possible to use a 
cutoff point in such a way that a pa- 
tient falling above it is unequivocally 
identified as brain-damaged, while, if 
he falls below it, he (being brain-dam- 
aged) is not misclassified ; but the ques- 
tion remains an open one, no state- 
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ment being made about the patient 
However, when considering the 
standardization of a test, one may 
reasonably demand a low percentage 
of  misclassification, because the 
groups are carefully chosen for their 
clinical differences. The various tests 
of brain damage will now be consid- 
ered to see how far they fulfill the cris 
teria already laid down. 

The tests will be considered under 
two broad headings—those testi 
which employ qualitative methods. 
and those which employ quantitative: 
methods. Those tests using quanti 
tative methods will, in turn, 
grouped into those utilizing the com 
cept of deterioration and those meas 
uring perceptual or motor functions. 
Such a division is, of course, quite: 
arbitrary, and is dictated by conven- 
ience. 


QUALITATIVE TESTS OF BRAIN 
DAMAGE 


The criteria laid down above pre- 
suppose adequate statistical treat- 
ment of data. However, such treat- 
ment is almost entirely lacking in one 
of the most widely used and reputable. 
batteries of tests of brain damage— 
the tests of abstract concept for na- 
tion developed by Goldstein ang 
Scheerer (22) and their colleagues 
These tests are so well known thatit 
is unnecessary to describe them. tagi 
basic criticism to be made of these 
tests is not that they are invalid, b 
that there is no basis for a discussion 
of their validity. The tests are unique 
in diagnostic psychological testing i 
providing no quantitative data on U^ 
subjects used and providing no per- 
centage of incorrect diagnoses, in E 
noring the effects of age and inteli- 
gence level completely, and in assu 
ing that the performance of norma 
people of average intelligence will n 
without error. Under these circ 
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stances, it is clear that the validity of 
the tests is something private to each 
individual user. It is possible that, in 
the hands of a skilled clinician, the 
tests have a high validity; but, even in 
the case of Goldstein himself, no in- 
formation is available about the num- 
ber of times he has been in error. Two 
further points may be made: First, 
there has never been any clear agree- 
ment concerning what the tests meas- 
ure. Goldstein believes that they 
measure the ability to abstract, and 
that brain injury leads to a defect in 
assuming the abstract attitude, which 
is revealed in test performance; on 
the other hand, Hutton (36) believes 
that failure on the Block Design test 
is due to overabstraction, not failure 
of abstraction. Second, it is signifi- 
cant that the Goldstein-Scheerer tests 
have been used to determine the pres- 
ence or absence of schizophrenic 
thought disorder, and that other 
tests, developed specifically to test 
for schizophrenia, bear a strong re- 
semblance to the Goldstein tests, e.g., 
the Hanfmann-Kasanin Test of Con- 
cept Formation. Little work has been 
done to differentiate between the per- 
formance of schizophrenics and brain- 
damaged patients on these tests; 
duo they are very difficult to apply 

inically. 
Eat the necessary quantification 
E E be possible may be easily in- 
i rec from examination of Gold- 
temn's manual, where many possible 
e nona of ‘measurement are indi- 
ni ed in qualitative form. A begin- 
ng has, in fact, been made. Boyd 
E urne 54 normal hospitalized 
for he bh up quantitative norms 
eg lock Design test by giving 
t ts to the various steps utilized 
[m oldstein. When he divided his 
ir ae to Wechsler IQ, he 
«ec s perfect performance would 
€ expected, provided the IQ 
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was 100 or more. As the IQ dropped, 
however, so did the score on the 
Block Design test, from a mean score 
of 120 (SD=0) for IQ 100-109 to a 
mean score of only 99.5 (SD 34.5) 
for IQ 66-79. He was able to give 
limits within which a person of a 
given IQ level would be expected to 
fall. He thus demonstrated that in- 
telligence is an important factor in 
performance on these tests; he also 
found that psychotics tended to do 
worse than brain-damaged patients 
(though his numbers were very 
small). 

Lidz, Gay, and Tietze (41), using 
the scoring system developed by 
Kohs, found that the Kohs test dis- 
criminated significantly between 21 
organics with deterioration, and 15 
nondeteriorated schizophrenics (with 
a misclassification of only 2/36). 
However, there was no control for 
age, the difference between the two 
groups being certainly significant. An- 
other attempt to quantify the Block 
Design test was reported by Armi- 
tage (4). Using three groups of sub- 
jects—normal, neurotic, and brain- 
damaged—he calculated the percent- 
age of controls and organics requiring 
assistance at each of four steps. He 
concluded that the quantitative re- 
sults were disappointing from the 
standpoint of a screening device; so, 
to determine whether the discrimina- 
tory ability of the test could be 
increased, he made a multiple ap- 
proach. This involved the computa- 
tion of such variables as the time 
taken to complete each design, the 
number of incorrect moves, the order 
or sequence of placing the blocks, and 
the number of correct first block 
placements. None of these methods 
proved successful. 

Tooth (67) gave a series of tests, 
including the Kohs Block test and the 
Weigl Color Form Sorting Test, to 
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100 cooperative naval officers and 
ratings admitted to a naval hospital 
with a history of an injury to the 
head and a diagnosis of postconcus- 
sional state. As controls, he used 50 
convalescent patients in the surgical 
wards of the same hospital and 50 
neurotics carefully chosen from 2,000 
cases seen in a naval psychiatric 
clinic. After careful statistical analy- 
sis of the results, he concluded that 
the method did not give sufficient 
quantitative discrimination between 
head injury cases and normals to be 
of much practical importance in the 
assessment of the individual case. 
Furthermore, in these two tests, a 
difference of as great a magnitude was 
found between the normal controls on 
the one hand and a series of neurotic 
patients in whom no organic condi- 
tion was known to exist on the other. 

With respect to object sorting, 
Halstead (27), using groups of cases 
with lesions in various areas of the 
brain, compared them with a normal 
group on an object-sorting test, 
larger in content and presented under 
conditions somewhat different from 
those obtaining in the Goldstein test. 
The sorting behavior was analyzed 
according to a number of criteria, in- 
cluding the percentage of objects 
grouped, the total percentage of ob- 
jects recalled after five minutes, and 
the percentage of objects grouped 
that were recalled after the same in- 
terval. With respect to these three 
variables, for 11 normals and 11 
cases of frontal lobe injury there was 
no overlap in the scores of the two 
groups on the first and third vari- 
ables, and only two brain-damaged 
cases exceeded the lowest normal 
score on the second variable. On the 
other hand, cases with lesions in other 
parts of the brain showed consider- 
able overlap with normals. Hal- 
stead’s results confirmed to some 
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extent Goldstein’s hypothesis that 
performance as a whole on this test 
tends to be lowered in cases of frontal 
lobe lesion. He also offered some 
evidence for the hypothesis that pa- 
tients with injury to the frontal lobes 
cannot sort according to an abstract 
principle. In terms of the number of 
abstract groupings produced, there 
was again no overlap between the 
normals and the cases of frontal lobe 
injury. 

A rather radical and important de- 
parture from the usual procedure in 
the Block Design test has been made 
by Grassi (23, 24) in his Block Sub- 
stitution Test for measuring organic 
brain pathology. Essentially, the 
main innovations are that the pa- 
tient copies not a drawing but a set 
of blocks, and an attempt is made to 
measure both concrete and abstract 
reproduction at two levels. The test 
consists of five designs or models, 
each of which is reproduced by the 
patient at four levels of increasing 
complexity. In the simple concrete 
task, the patient copies only the top 
side of the model. In the simple ab- 
stract task, he copies the top side 
again, but this time using different 
colors from those of the model. In — 
the complex concrete task, the pa 
tient has to copy the model correctly 
with respect to all six sides, while in | 
the complex abstract task, he copies 
the complete model again, but this 
time using different colors from those 
of the model. The scoring system !$ 
simple, accuracy and time being T€ 
warded; the maximum score poss! e 
is 30. The standardization groups afe 
unusually large for this kind of work, 
consisting of 86 normals, 86 E 
phrenics (equally divided into e 
teriorated and nondeteriorat 
cases), 72 organics, and 30 postlo- 
botomy cases. Grassi states that 10 
influence of age, sex, or other factors 
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was found, while test-retest relia- 
bility was high (+.85). Means and 
ranges are given for the four groups 
and show that (using a cutoff point 
of 16) there was no overlap between 
the two organic and the two nonor- 
ganic groups. If this finding can be 
confirmed, it is clearly of great impor- 
tance. There is some doubt, however, 
about the accuracy of Grassi's fig- 
ures. For example, although the 
ranges of the organic and nonorganic 
groups do not overlap, Grassi reports 
in another section that six schizo- 
phrenics scored below 16 and were 
not included in the standardization 
data; also, that eight organics scored 
"slightly above" 16 and were cor- 
rectly identified by qualitative exami- 
nation. However, the misclassifica- 
tion is sufficiently small to make fur- 
ther research on this test imperative. 

Grassi's (24) attitude toward his 
own statistics is difficult to under- 
stand. He writes: "It cannot be too 
Strongly stressed that test scores 
taken alone will lead to incorrect 
conclusions. The score is intended as 
à guide, not as the sole criterion for 
the final conclusion. Behavior cannot 
be too greatly emphasized and should 

, in most cases, the basis for diag- 
Dosis. Test score and intellectual 
level are supporting factors. They 
must not, ever, be the sole basis for 
final classification" (24, p. 63, italics 
his. Two comments are rele- 
vant here. In the first place, for 
the standardization groups at least, 
Since there is no overlap, it is hard to 
ha how a quantitative score could 
ead to a false diagnosis of brain dam- 
age. Second, a subjective statement 

at the patient's behavior was quali- 
fatively such and such is in fact a 
oy quantitative statement 
"A havior, albeit in a crude and 
: Ccurate way. The aim of any test 
Onstructor must surely be to quan- 
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tify as soon as possible his initially 
crude and unrefined observations 
and thus set up proper control, Why 
Grassi, having done this, should then 
turn his back on it, is difficult to see. 
The consequences, however, are seri- 
ous, Although Grassi himself states 
that intelligence can influence per- 
formance, no adequate information is 
given. This would have been espe- 
cially valuable in view of the rela- 
tively large numbers of subjects used. 
In spite of these criticisms, the Grassi 
test represents an important advance 
over the original Goldstein tests. 
Clinical use of the Color Form test 
of the Goldstein battery has sug- 
gested that it is too easy to be useful 
in the detection of brain damage 
except in severe cases. This was con- 
firmed by McFie and Piercy (43) who 
used 55 cases of known brain damage 
and scored the test simply as "pass" 
or “fail” on the basis of the patient's 
ability to sort the pieces both by form 
and by color. They found that only 
9 of the 55 patients failed the test. 
An important modification of the 
Color Form test has, however, been 
published by Scheerer (60). Three 
large model figures—a circle, a tri- 
angle, and a square—are used, and 
there are 12 test figures, four being 
subsumed under the concept of 
roundness, four under the concept of 
triangularity, and four under that of 
squareness. The three large figures 
lie on the table as model figures. The 
subject is shown each of the remain- 
ing figures in turn and asked to indi- 
cate with which model figure it be- 
longs. Each figure is removed from 
the subject's sight before the next 
one is presented. A simple, objective 
system of scoring has been devised, 
failure being counted when the sub- 
ject incorrectly assigns a figure. If 
this happens, a series of three “helps” 
is given, similar in nature to the suc- 
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cessive steps on the Block Design 
test. The test was standardized on 
four groups of subjects. The first 
group consisted of 44 college stu- 
dents; the second group was made up 
of 20 noncollege students; the third 
group consisted of 20 brain-damaged 
patients; and the fourth group in- 
cluded 20 retarded high school pupils 
who were used as controls for those 
brain-damaged patients of low intelli- 
gence. Scheerer found that the per- 
centage of subjects needing the three 
"helps" fell rapidly after the first 
help for the three non-brain-damaged 
groups, but that all the brain-dam- 
aged patients needed the third help 
and 75 per cent of them failed all 
three helps; no subjects in the other 
three groups failed all three helps. 
This study is of importance, not 
only for its success in identifying 
brain-damaged patients, but in repre- 
senting one of the few serious at- 
tempts at quantification of the Gold- 
stein-Scheerer tests. However, some 
criticisms are possible. The instruc- 
tions contained in the original article 
are inadequate for clinical use—for 
instance, it is not made clear what 
happens if the subject makes more 
than one error after the first help has 
been given; the test objects are pre- 
sented in a fixed, not random, order, 
so that a pattern of response may be 
set up which masks inability to do the 
test. In this connection it may be 
noted that the figures given by 
Scheerer indicate that, compared 
with the noncollege group, fewer of 
the retarded group needed the third 
help, and that, compared with the 
college group, fewer retarded needed 
the second help. Again, the test has 
not been given to a group of func- 
tional patients, so the degree of over- 
lap between such a group and the 
brain-damaged group is not known. 
As far as is known, no subsequent in- 
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formation has been made avail: 
this test, and no critical studies 
been published. 

Zangwill (in Buros, 10) has cri 
cized the Goldstein tests on the ge 
eral grounds that they are quali! 
that some psychotics show 
ment; that some brain-damag 
tients with traumatic injuries b 
essentially normally on the tests; a 
that gross impairment on the test 
due to a specific lesion, may b 
present in the absence of any occupi 
tional or social difficulties. It may! 
added that no information is | 
able concerning the factorial compos 
tion of the tests. P 


QUANTITATIVE TESTS OF — 
BRAIN DAMAGE 


Tests Employing the Concept of Deter 
oration 
The tests considered under 

heading are based on the prin 

that brain damage leads to det 

tion of an irreversible nature. 4 

deterioration usually is contr 

with that found in the functional 
orders, wherein deterioration is € 
sidered to be more apparent 
real, and to be due to inattention 
assumption being that the pa 
retains normal ability if he coul 
made to perform at his best le 
is not proposed to discuss the com 
of deterioration as such, and the 
tests will be criticized only in so far 
they claim to distinguish brain-c 
aged patients from other groups. 
The Hunt-Minnesota Test for © 
ganic Brain Damage (34) consists 
three major divisions—a vocabu 
test, which is relatively insensi! 
brain damage; a group of tests 
tive to deterioration; and a grou 
interpolated tests. The subje 

Stanford-Binet vocabulary scot 7 

relation to his age, determines 
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score level at which he is expected to 
perform the more sensitive tests. The 
deterioration tests, consisting of pairs 
of words and of designs that the sub- 
ject is expected to associate and later 
recall and recognize, determine the 
level at which he is actually function- 
ing. The amount of discrepancy 
between the subject's expected score 
and the score he actually makes on 
the word and design associations 
(when corrected for age) is the basis 
for the diagnosis of brain damage. 
The discrepancies are indicated by 
T scores; those T scores which fall 
higher than a certain critical point 
are considered to be indicative of 
brain damage. Originally, this criti- 
cal T score was fixed at 68; later it 
was reduced to 66. The test may be 
given in either a short or long form. 
, Two groups of subjects were used 
in the development of the test: 33 pa- 
tents suffering from brain damage 
(all but three were cases of diffuse 
damage); and 41 controls, consisting 
of 15 neurotics, 11 normals, 6 psy- 
chotics, and 8 nonpsychiatric pa- 
tients. All subjects had to be able to 
read and speak English, had attended 
School to third grade, were ade- 
quately cooperative and attentive, 
had adequate muscular coordination 
ànd sensory acuity, and were between 
the ages of 16 and 70. For discrimina- 
tive purposes, 25 persons equated for 
Vocabulary and age were drawn from 
ĉach group. Using a cutting point of 
68, Hunt found that only one of the 
x standardization subjects was mis- 
pen and that, of the remaining 
i Subjects, only three were mis- 
assified. (He ignored three subjects 
Who scored 68.) 
Keiner use of the interpolated tests 
Vas 3 of concentration and attention) 
ion pased on the following assump- 
N 8: deteriorated brain-damaged pa- 
"ts would fail the learning tests 
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but not the interpolated tests; func- 
tional, "apparently deteriorated" 
patients would succeed on both learn- 
ing and interpolated tests; functional, 
"genuinely deteriorated" patients 
would fail both learning and inter- 
polated tests. Implicitly, therefore, 
failure on the interpolated tests was 
to be utilized as a measure of whether 
a functional patient was really, or was 
only apparently, deteriorated. In 
fact, this ingenious hypothesis was 
not directly tested by Hunt; his only 
finding was that both of the groups 
he used were not differentiated by the 
interpolated tests. 

While, in its construction and 
standardization, this test fulfills to 
some extent the first two criteria laid 
down earlier in this paper, the results 
reported by Hunt have not been 
confirmed by other workers, while 
important reservations in the use of 
the Hunt-Minnesota vocabulary test 
are necessary. 

The discrimination between nor- 
mals and organics has not been found 
by other investigators. Thus, Aita 
and his associates (1), using groups 
already referred to in the study by 
Armitage (4), found that 10 per cent 
of their control subjects obtained a 
T score greater than 90, but that only 
2.2 per cent of their brain-damaged 
subjects obtained a score as high as 
this. On the other hand, 41.3 per 
cent of their brain-damaged subjects 
obtained a score that was in the nor- 
malrange, i.e., below 70. Many of the 
brain-damaged patients actually 
scored well below the cutoff point 
used by Hunt, 23.9 per cent of them 
falling within the range 50-54, When 
the brain-damaged group was di- 
vided into three categories— severe, 
moderate, and mild—only the mean 
T score of the severe group was sig- 
nificantly different from that of the 
controls. » 
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A study by Canter (11) reinforced 
this finding that the amount of over- 
lap between a brain-damaged group 
and a normal group may be consider- 
able. For 47 arteriosclerotic patients 
the mean T score was 69.04 (SD 
=12.49)—that is, the group mean 
fell within normal limits. Malamud 
(45), in a preliminary study, found 
that six out of ten members of the 
psychology department at her hos- 
pital were suffering from brain dam- 
age according to results on this test! 
She therefore administered the test 
to 64 subjects who satisfied Hunt's 
basic criteria. She found that 54.7 
per cent of these normal subjects ob- 
tained scores indicating organic brain 
damage. The exact critical score used 
did not influence the results appreci- 
ably. When all vocabulary scores 
above 33 words or more were arbi- 
trarily reduced to 32, no less than 
42.2 per cent of the total group still 
fell within the pathological range. 

In addition to this overlap between 
organics and normals, it has been 
shown by Meehl and Jeffery (47) 
that some functionals at least may 
show pathological scores on this test. 
They gave the test to a group of 15 
patients with functional depressions, 
of whom 9 were psychotic and 6 
neurotic. The possibility of brain 
damage was ruled out in all cases, and 
11 of the subjects did not fail any of 
the interpolated tests. The mean 
T score of the entire group was 70.2 
(SD=15.41). Using a critical score 
of 70, they found that one in four 
functionally depressed patients would 
obtain an organic score; using a cut- 
off point of 66, the ratio would be one 
in three. 

We have already seen that Mala- 
mud found that a high vocabulary 
score did not markedly affect the pro- 
portion of normals showing a patho- 
logical score on this test. However, a 
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showed that when the test w. 
to a group of 50 college studen 
reached Superior Adult III | 
the Binet vocabulary test, the 
tained a mean T score of 
whereas Hunt's control-group 
was 50. Only 3 per cent of Hum 
control group obtained a score 
than this mean score. Of this coll 


point. Juckem and Wold com 
that the test yields far too 
false positives among persons of 
vocabulary level. If a normal T. 
were to be obtained, the vocabt 
level would have to be reduced 
words. 1 
These studies show that, in pra 
tice, the test has not lived up to 
claims made for it by its author, 
its usefulness as a test of brain 
age is very much in doubt. ; 
The Shipley-Hartford Re 
Scale (64) is a short test of two 
one consisting of a vocabulary 
which is supposed to hold up 
age or illness; the other, of a set 
abstract reasoning problems, 
posedly sensitive to deterioration. 
Conceptual Quotient (CQ) is deriv 
from scores on the two parts of x 
test and may be defined roughly 88 
Mental Age (Abstractions 
/Mental Age (Vocabulary T€ 
X100. Mental age norms were es 
lished on 1,046 young subjects. 
had been given a group intelli 
test. It was assumed that nom 
persons, regardless of native inti 
gence, would approach CQ's ot 
This assumption proved to be ro 
true within normal limits. A 
above 90 was considered normaly 
between 76 and 89, suspicious; 8! 
CQ below 75, pathological. The! 
was validated on 171 state 
cases and 203 private hospital 
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Shipley and Burlingame (65) claimed 
that the test would discriminate be- 
tween normals, schizophrenics, and 
organics, and it is for this reason that 
the test is included here. They found 
that organics did worst, that neu- 
rotics and normals fell close together, 
and that the psychotics came in be- 
tween. 

This test may be criticized severely 
on many grounds. The standardiza- 
tion data were completely inade- 
quate, only young intelligent normals 
being used in the construction of the 
scale. There was no control for age, 
sex, or intelligence, yet the factor of 
age is of particular importance in this 
test. Subsequent research has shown 
beyond doubt that the test does not 
discriminate between organic and 
other groups without considerable 
overlap. 

Thus, Aita and his colleagues (1) 
found that, of a control group of 61 
normal subjects, 47 per cent ob- 
tained a CQ that was suspicious or 
pathological, and 26 per cent of these 
61 subjects obtained a definitely 
pathological score. Of a brain-dam- 
aged group of 70 patients, 33 per cent 
fell within the normal range of scores. 
Contrary to Shipley's assertion, they 
Íound that mild or moderate neurosis 
lowers the CQ, although they present 
no figures. Canter (11) used the 
Shipley-Hartford scale on 47 cases of 
Multiple sclerosis and obtained a 
mean CQ score of 85.8 (SD =14.31). 


. Despite the unfavorable difference 


In age between his group (mean age, 
32) and the standardization group of 
Shipley, he found that 50 per cent of 
B cases obtained CQ's greater than 
6, and 40 per cent obtained CQ's 
greater than 90, indicating considera- 
le overlap with normals. 
: Magaret and Simpson (44) gave 
e test to 50 patients consisting of 
PSychotics and neurotics, aged 40-49, 


367 


and found a mean CQ of 74 (SD 
12.7), with a range of scores from 55 
to 105. Of the 50 cases, 29 obtained 
a CQ lower than 70. Here again, al- 
though age would be expected to 
work in favor of lower scores, the 
overlap with normals would still be 
considerable, while the overlap with 
organics would be even greater. Fur- 
thermore, correlations of the CQ with 
psychiatric ratings of deterioration 
and with the Wechsler deterioration 
index were not significantly greater 
than zero. These results are the 
more remarkable in that Garfield 
and Fey (21) showed that, in fact, 
the CQ did decline steeply, solely as a 
function of age. Their results other- 
wise agree with those of Magaret 
and Simpson: using a group of 100 
patients (58 psychotic, 37 neurotic, 
13 unclassified), they found a mean 
CQ of 79.1 for the psychotics and 
85.2 for the neurotics. 

A further study by Manson and 
Grayson (46) was even more dis- 
turbing. Of Shipley's standardiza- 
tion group, 26 per cent obtained a 
CQ below 90. Of a group of 1,262 
military prisoners examined by these 
authors, no less than 59.1 per cent 
obtained a CQ less than 90; 34.6 per 
cent a CQ less than 80; and 14.2 per 
cent a CQ less than 70; the mean CQ 
for the group was only 88.8. They 
suggested that the reason for this 
discrepancy was a spuriously high 
vocabulary score, and showed that 
substitution of the Army General 
Classification Scores for the vocabu- 
lary scores produced figures very 
similar to those of Shipley. This con- 
firms the previous findings that spu- 
riously organic scores may beobtained 
on tests which use vocabulary level 
as a measure of previous intellectual 
functioning. Manson and Grayson 
also report that 45 per cent of severe 
neurotics scored below 80 on this 
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test. This was confirmed by the 
work of Kobler (39), who reported 
that, of 500 neurotics at an army re- 
habilitation center for combat fa- 
tigue, 78 per cent made CQ's below 
90; 26 per cent had CQ's below 70 
and thus fell within the organic range. 

Fleming (20) reported that 20 de- 
pressives obtained a mean CQ of 
only 73.2. However, the mean age of 
his group was 57.6 years. Ross and 
McNaughten (56) used 90 subjects 
with head injury and found that the 
results bore no relation to severity of 
the injury or to EEG or pneumoen- 
cephalographic evidence of cerebral 
damage except in cases of severe in- 
jury. No statistical data were given, 
however. 

How far the original faulty stand- 
ardization of the test and lack of care 
in its construction have been respon- 
sible for the failureof subsequent work- 
ers to establish the validity of the test 
is difficult to estimate. The findings 
quoted, however, certainly indicate 
that the test has not been adequately 
validated. 

There have been a number of at- 
tempts to develop indices of dete- 
rioration by using the Wechsler sub- 
tests in various combinations. The 
rationale of these tests is well known. 
A ratio is calculated between those 
tests which are said to hold up with 
age, and those which do not hold up 
with age. The underlying assump- 
tion is the rather curious one that or- 
ganic deterioration is similar to the 
deterioration accompanying age, dif- 
fering only in its early onset. Wechs- 
ler (68) found that the index dis- 
criminated between young normals 
and young brain-damaged patients, 
the percentage overlap, however, 
being high. Thus, there was a re- 
striction on the usefulness of the test 
right from the start. Levi, Oppen- 
heim, and Wechsler (40) specifically 
claimed that the DI was useful, not 
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only in confirming, but also in dis 
covering, organic conditions. They 
claimed that it would assist in differ 
entiating organic memory impair 
ment from hysterical amnesia; im 
finding corroborative evidence of or 
ganic involvement where clinical and 
neurological data are not clear-cut; 
in distinguishing between mental 
deficiency and mental deterioration; 
and in differentiating between psy- 
chosis with and without organic de 
terioration. With the exception of 
the first, these claims have been 
tested and invariably have beem 
found wanting; and it seems safe to 
say that the first has not been found 
wanting only because it has not been 
tested. 

Two important modifications of 
the Wechsler DI are Reynell's index 
(52), which makes use only of the 
verbal subtests, and Hewson's de- 
viation ratios (31), which are 
on various combinations of the sub- 
tests. Reynell’s index, however, has 
been incorrectly criticized by some 
workers—its intention was not to 
discriminate brain-damaged cases 
from others, but to discriminate those. 
brain-damaged cases with deteriora 
tion from those without (and hence 
assumed initial knowledge of brait 
damage). 

Many studies have now been I" 
lished on the DI. This paper 1$ CO* $ 
cerned only with those investigating 
the validity of the tests as a measure 
of brain damage. At least six studies 
concur in reporting unfavorably 0n 
its use. Thus Gutman (25), using 30 
organics and 30 controls, found E 
the Wechsler DI correctly identifi 1 
only 43 per cent of organics, 
Reynell index 50 per cent, and | 
Hewson ratios 60 per cent; whereas 
the DI misclassified 33 per cent of p. 
normal group, the Reynell index "A 
per cent, and the Hewson ratios 17 pe 
cent. The three measures agr i 
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the diagnosis of brain damage in only 
M per cent of the cases. Five cases 
of clinically verified brain damage 
did not fall in the organic range on 
any of the tests. Allen (2), using as 
his criterion of deficit a loss greater 
than 20 per cent, found that the 
Wechsler DI definitely screened out 
only 54 per cent of the total study 
group of 50 patients, Rogers (53) 
evaluated the DJ for seven groups 
(349 subjects) and found that, using 
a cutting score of 10 per cent, 75 per 
cent of subjects will be correctly 
identified, provided that only the 
brain-damaged and normal groups 
are used; but that, when other clin- 
ical groups are included, the results 
are no better than chance. Andersen 
(3), using 55 male soldiers with defi- 
nite clinical evidence of brain dam- 
age, showed that, when a cutting 
score of 10 per cent was used, nearly 
one-third of the total sample fell out- 
side the organic range; yet when a 
cutting score of 20 per cent was used, 
nearly two-thirds of the patients fell 
inside the normal range. He divided 
his group of patients into those suf- 
fering predominantly from injury to 
the dominant hemisphere and those 
suffering from injury to the nondom- 
mant hemisphere. This did not ma- 
terially improve the results. Kass 
(38) gave the test to 18 cases with 
known organic damage and 12 cases 
of dubious organic diagnosis, and 
concluded that the DJ failed both in 
detecting and confirming the presence 
= organic conditions resulting largely 
Tom traumatic brain injury. As a 
Percentage-loss method for express- 
ing psychological deficit, it was found 
applicable in two-thirds of his 
ym Diers and Brown (15), using 
d Umm of multiple sclerosis, con- 
i ed that the DI was not sensitive 
an to be used clinically. Garfield 
s Fey (21) found that an equal 
umber of psychotic and nonpsy- 
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chotic patients obtained pathologi- 
cally high DJ's, suggesting that the 
overlap between organics and func- 
tionals would be quite high. Magaret 
and Simpson (44) found that the DI 
rating did not correlate with the psy- 
chiatrist's ratings of degree of dete- 
rioration. The only study so far to 
produce reasonably favorable results 
with the DI was that by McFie and 
Piercy (43). Using 56 brain-damaged 
patients and a cutting score of 10 
per cent, they were able to identify 
43 (71 per cent) of them; using a cut- 
ting score of 20 per cent, they iden- 
tified 37 (66 per cent). No functional 
patients were tested. In view of the 
unfavorable results summarized 
above, it seems clear that indices of 
deterioration are of little clinical use 
in their present form. 

Halstead (28), using factorial an- 
alysis and various systems of weight- 
ing, developed a battery of tests 
which discriminated at a high level 
of confidence between normals and 
patients with lesions of the frontal 
lobes. The ten tests having the high- 
est t value were selected as the basis 
of an impairment index. In this ar- 
rangement, an individual whose 
scores fell below the criterion scores 
on all ten of the key tests had an im- 
pairment index of 0.0; while, on a 
simple proportion basis, an individ- 
ual who satisfied the criterion 
score on three of ten key tests had an 
impairment index of 0.3; or on all of 
the key tests, an index of 1.0. Using 
a cutting score of three, he was able 
to identify all 27 cases of frontal lobe 
injury and 29 out of 30 normals. The 
impairment index did not discrimi- 
nate between normals and other cases 
of brain damage. 

Of the ten tests, the Halstead 
Category Test (28, 29), involving the 
ability of the subject to “abstract” 
various organizing principles such as 
“size,” “shape,” “color,” etc. from a 
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series of 336 stimulus figures pre- 
sented visually and serially by means 
of a multiple-choice projection ap- 
paratus, proved particularly success- 
ful. Using a cutting score of 70, he 
correctly identified 27 out of 29 nor- 
mals and 10 out of 11 cases with fron- 
tal lobe injuries. 

When, therefore, the patient is 
known on other grounds to be neither 
psychotic nor neurotic, this battery 
of tests offers a very accurate indica- 
tion of whether or not the lesion is 
situated in the frontal lobes. The im- 
pairment index was validated on a 
group different from the standardiza- 
tion group and was repeated on an in- 
dependent group. The only obvious 
objection to the index is the inade- 
quate representation of groups other 
than normals or brain-damaged. 


Perceptual and Motor Tests of Brain 
Damage 


Many attempts have been made 
to use the Rorschach test for the 
diagnosis of brain damage, The ap- 
proach has usually been made in 
terms of signs specific to brain dam- 
age. The approach is basically simi- 
lar to that used in tests employing 
the concept of deterioration, for the 
signs are taken to indicate inadequate 
or lowered performance. One of the 
earliest approaches was made by 
Piotrowski (49, 50), who used 33 
records, consisting of 18 brain-dam- 
aged cases, 10 cases with noncerebral 
disturbance of the central nervous 
system, and 5 cases of conversion 
hysteria. Ten signs were selected as 
differentiating between the three 
groups. These signs were very care- 
fully defined by Piotrowski. The or- 
ganic group produced a mean of 6.2 
signs; the other groups, a mean of 
1.5 signs; there was no overlap be- 
tween the groups. Thus, it was con- 
sidered that the presence of five or 
more of the Piotrowski signs was in- 
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dicative of brain damage. In a later 
paper (51), Piotrowski showed that 
the number of signs produced was a 
function, in part, of the severity of 
the personality changes produced by 
the disorder, and that the number of 
signs also increased with age. He 
also pointed out that some of his 
signs were produced by schizophren- 
ics and neurotics. He claimed that, 
out of 56 patients producing five or 
more signs, 55 were in fact organic; 
of 25 patients originally considered, 
but later rejected as such by neuro- 
logical tests, only one produced an 
organic record. 

Ross (54) tried to repeat the find- 
ings of Piotrowski. He used two 
groups which closely approximated 
those used by Piotrowski. He also 
tested several other groups and found 
that the signs occurred highly signifi- 
cantly more often in the group with 
cerebral lesions, and significantly 
more often in the epileptics, than 
in all the other groups together, How- 
ever, although five or more of these 
signs were found most often in pa- 
tients with disease of the cerebral 
cortex and subcortical tissue, they 
were not specific for these lesions. 
Thus, 55 per cent of brain-damaged 
patients showed five or more signs, 
but so did 30 per cent with noncor- 
tical lesions of the central nervous 
system, and also 20 per cent of the 
psychotics and 14 per cent of the 
neurotics. Furthermore, Ross showed 
that 50 per cent of the cortical cases 
and 53 per cent of the epileptic cases 
showed five or more of the Harrower- 
Erickson signs of neurosis. 

In two later papers (55, 57), Ross 
divided the signs into four groups: 
those common to neurotic and of 
ganic patients; the neurotic differen- 
tial signs; the organic differenti 
signs; and the organic excluding 
signs. Each sign was then weight: 
in rough proportion to its differentia 
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incidence in the groups of individuals 
being compared. The four sets of 
scores were then combined for each 
individual to give two ratings. These 
two ratings were called the “instabil- 
ity" and "disability" ratings, and 
were standardized on 50 neurotics, 24 
organics, 50 superior, and 50 average 
normals, Thisrepresented an advance 
on Piotrowski's method, in that a pa- 
tient could be given a rating on both 
the organic and the neurotic dimen- 
sions. 

The next important contribution 
came from Hughes (32, 33), who 
derived 14 signs from a factor an- 
alysis of the 22 signs he found in the 
literature on brain damage. Differ- 
ent weights were assigned to these 
signs according to their discrimina- 
tory power. He used 218 subjects, 
including 50 with brain damage, 68 
schizophrenics, 74 neurotics, 4 manic- 
depressives, and 22 normals. The 
point-biserial correlation between 
presence or absence of brain damage 
and score on the sign pattern was 
+.79, which was highly significant. 
When a cutting score of seven or 
above was used, 82 per cent of the 
Organics were correctly identified, 
while only 1 per cent of nonorganics 
Were falsely identified as organic. 
Using Piotrowski's signs, Hughes 
could correctly identify only 20 per 
cent of the organics without includ- 
Ing many nonorganics. 

However, these encouraging re- 
sults were shown to have a serious 
xd by the recent work of Diers and 
Sale (16). They took a group of 

multiple sclerotics and divided 
10 al into two groups, 15 with an 
puo 102, and 10 with an IQ be- 
full 02. This latter group was care- 
uly matched with 11 nonorganic 
patenta with IQ's below 102. The 
: ree groups were then tested and the 
€sults analyzed for Hughes's signs. 

e mean number of signs for the 
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more intelligent organics was only 
0.33, none obtaining a score above 
seven, and ten obtaining a zero or 
negative score. As opposed to this 
the organics of average or dull IQ ob- 
tained a mean of 5.2 signs, and the 
nonorganics of average or dull IQ 
obtained a mean of 4.18 signs. None 
of the patients in these groups ob- 
tained a negative score. Diers and 
Brown concluded that there is an 
inverse relationship between IQ meas- 
urement on the Wechsler and the 
weighted Hughes score, independent 
of the factor of intracranial pathol- 
ogy. The Hughes signs are, there- 
fore, invalid, unless obtained with a 
patient of high intelligence—and 
the study showed that, in fact, none 
of the intelligent organics used by 
Diers and Brown fell within the or- 
ganic range. 

A new approach was made to the 
problem by Dörken and Kral (17), 
who, instead of asking what signs 
the organic patient would show on 
the Rorschach, investigated what 
signs he would be likely not to show. 
Seven signs were determined, and 
each sign was weighted according to 
degree of occurrence in organic or 
nonorganic states. By this means, 
a total possible score of ten was de- 
termined, In terms of the frequency 
distribution, the cutting point was 
fixed between two and three; that is, 
scores from three to ten, inclusive, 
exclude a diagnosis of brain damage. 
Using this method, they claimed that 
92.9 per cent of organics were identi- 
fied, and 83.3 per cent of nonorgan- 
ics were identified as such. If Pio- 
trowski's signs were used however, 
only 50 per cent of the organics would 
be identified; while use of the Ross 
“disability” ratio identified 75 per 
cent of the organics. 

One additional study may be men- 
tioned here. Buhler, Buhler, and Le- 
fever (8), using,30 normals, 70 neu- 
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rotics, 50 psychopaths, 27 schizo- 
phrenics, and 30 organics, developed 
à Basic Rorschach Score. They 
claimed that this score was capable 
of separating clinical groups in a 
statistically reliable manner. But 
they also stated that 


the variability of scores within each clinical 
group indicates that the Basic Rorschach 
Score alone is not a sufficient basis for indi- 
vidual diagnosis. The placement of the in- 
dividual case on a scale of adjustment or ego- 
integration does, however, appear to be a 
highly probable outcome of the use of the 
Basic Rorschach Score (8, p. 161). 


In other words, the technique is de- 
signed, not to discriminate between 
organic and other groups as such, 
but rather to give an estimate of the 
degree of mental illness—a procedure 
which is based, apparently, on the 
theory that persons vary on a single 
dimension from normal through neu- 
rotic to psychotic, and which has 
been criticized by Eysenck (18, 19). 
A more recent monograph by Buhler, 
Lefever, Kallstedt, and Peak (9) con- 
firmed the results obtained in the 
initial study. 

The major criticisms to be made 
of the Rorschach as a test of brain 
damage seem to be that in all the 
scoring systems, except that of Dér- 
ken and Kral, the greatest weighting 
is given to those factors that are the 
most difficult to score and that de- 
pend, therefore, to the highest ex- 
tent on the subjective evaluation of 
the examiner. Second, although all 
authors using the sign approach seem 
to obtain adequate differentiation of 
groups for their particular study, the 
differentiating power invariably 
drops considerably when these meth- 
ods are repeated by other workers. 
Thus, where Piotrowski claimed pow- 
erful discrimination for his signs, 
Dérken and Kral found that they 
identified only 50 per cent of their 
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organic group; and other groups 
signs used by Ross declined to ane 
ciency of 75 per cent. Third, most 
the studies have ignored the inf 
ence of age and intelligence; th 
classic example of this is, of co 


sight seemed so promising. We must 
conclude, therefore, that, while mo 
workers in this field report satisfac 
tory discrimination, the constant“ 
eat dog" method by which one s 
of signs is set aside as unusable and 
replaced by a new set by subsequent 
workers does not inspire confident 
in the validity of the most n 
method, that of Dérken and 1 
That the Rorschach offers distinc 
promise in this problem cannot 
denied; that it has been shown to b 
a satisfactory test of brain damage 
open to question. 

Several tests have been recentl 
constructed which are not very 
known. Two of these are describe 
in the monograph by Armitage 
In their original form, these tests 
known as the Trail Making Test 
the Patch Test. The Trail Makin 
Test consists of two parts. Part 4 
consists of a sheet of paper on wh 
there is printed a series of circles. 
the center of each of these is a num 
ber, with a range for the test pro 
of 1-25, and for the sample, which i 
on the opposite side of the page, 
1-7. "These circles are spatially at 
ranged in a random order. The p: 
tient is required to draw a line b 
tween the circles, starting at numb 
1 and finishing at number 25. Hi 
asked to work as rapidly as possible 
and to erase errors. Part B is ver 
similar, but more complex, in 
the numbered circles are mixed 
circles lettered from A onwards. 
task is to draw a line from circle 
to circle A; then to go to circle t 
then to circle B, and so on. Scormg 
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is in terms of time and accuracy. If 
no errors are made, or if an error 
is corrected very quickly, the score 
obtained depends directly on the 
speed with which the test is com- 
pleted. If the item is accomplished in 
less than 20 seconds, a maximum 
score of ten is given; however, if 
longer times are taken, partial cred- 
its are given. The Patch Test re- 
quires the duplication of nine col- 
ored circular patterns, one serving as 
à demonstration design. For every 
pattern, the materials provided con- 
sist of 19 paper circles of different 
colors. Some of these are solid, while 
others have different sections of the 
interior part removed so that, by 
placing them on top of one another, a 
pattern may be formed. The test 
materials are arranged in front of the 
patient in a standard order. The 
eight test designs are in graded order 
of difficulty, and the test is termi- 
nated after two failures. Each design, 
with one exception, can be duplicated 
only by putting together specific 
pieces out of the 19 provided. In the 
standardization of these tests, Armi- 
tage used as subjects 44 patients 
known to have sustained brain dam- 
age (9 focal, 17 focal-diffuse, 18 dif- 
fuse). The control group consisted of 
45 normal subjects and 16 mild 
neurotics. The groups were consid- 
ered roughly comparable as to age, 
level of education, and pre-injury 
Occupation. 

It was found that Trail Making 
Test A, the simplest of the tests, 
discriminated the  brain-damaged 
group best from the two control 
groups. With a cutting score of 10, 
the total misclassification was 16 out 
of 95, 32 of the 44 organics being 
positively identified. Trail Making 
Test B identified 39 out of 44 organ- 
Ics, but misclassified 15 out of 51 
Normals; and a combined score was 
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not more effective. As a clinical in- 
strument, the Patch Test identified 
26 out of 43 organics, misclassifying 
7 out of 51 controls, The tests would 
appear to be useful ones and merit 
further study. Unfortunately, no 
data are given on the performance 
of psychotic patients, and the study 
has not been repeated. 

The genesis and construction of 
the Block Design Rotation Test has 
been described in three articles by 
Shapiro (61, 62, 63). This test re- 
sulted from the observation that 
some patients, while doing the Gold- 
stein block design test, left the com- 
pleted design in a rotated position 
compared with the test figure. This 
rotation might be as great as 45°, but 
rarely exceeded it. Various hypothe- 
ses were examined and it was found 
that the rotation effect could be 
maximized when three factors were 
interrelated in a certain way— 
namely, the factors of figure shape, 
ground shape, and angle of the line 
of symmetry (the line of symmetry 
being defined as the line which di- 
vided a design into mirrored halves). 
Thus, rotation was found to be max- 
imal when the figure and ground 
shapes were in a diamond orientation 
and the line of symmetry was at an 
angle. 

From various theoretical and em- 
pirical considerations, it was de- 
duced that the rotation effect would 
be maximized under these conditions 
in brain-damaged patients. Accord- 
ingly, a special set of cards was de- 
vised, 40 in all. Each design could be 
reproduced by using four Kohs blocks. 
Two groups of subjects, 19 brain- 
damaged and 19 psychiatric patients 
(the latter without brain damage), 
were carefully matched for age and 
sex, and given the test under iden- 
tical conditions. Results were ac- 
cording to prediction, the brain- 
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damaged patients rotating an aver- 
age of 8? per card, while the func- 
tional patients rotated an average of 
only about 2? per card. Considering 
the matched pairs together, every 
brain-damaged patient rotated more 
than the corresponding functional 
patient, with one exception—and 
this patient's diagnosis was later 
changed independently to one of 
compensation neurosis. With a cut- 
ting score of 6? rotation per card, the 
test identified 14 out of the 19 brain- 
damaged patients, but misclassified 
only 1 functional patient. Nearly all 
the functional patients rotated a 
total of less than 200? for the 40 cards, 
and only one rotated more than 
250°. Many brain-damaged pa- 
tients, on the other hand, rotated 
more than 400? or 500? for the 40 
cards, and much higher totals have 
been encountered in clinical practice. 

When a further group of 19 brain- 
damaged patients (this time consist- 
ing of patients who had actually been 
operated) was tested, a closely simi- 
lar pattern of results was found. 
Data on several normal control 
groups showed that the test also dis- 
criminated between normals and 
brain-damaged patients, and revealed 
the curious and unexpected fact that, 
as a group, the controls rotated sig- 
nificantly more than the functionals 
(though this was significant at the 
.05 level only). The effects of age, 
sex, and intelligence have also been 
calculated and will be published 
shortly. Thus, the rotation test satis- 
fies the criteria laid down in this 
paper in that it provides data on at 
least three groups of subjects, it takes 
into account factors such as intelli- 
gence, age, and sex, and its results 
have subsequently been confirmed on 
new, independent groups, It is also 
an entirely objectively scored test, 
the amount of rotation being re- 
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corded (in the later studies) photo- 
graphically, with reference to a con- 
stant base line. 

Using the original two groups of 
subjects, Shapiro (unpublished data) 
also found that the manual dexterity 
test and the finger dexterity test of 
the U. S. Employment Service battery 
of tests discriminated very signifi- 
cantly between brain-damaged and 
functional patients. The former test 
identified 14 out of the 19 brain-dam- 
aged patients and misclassified 3 out 
of the 19 functionals; the latter iden- 
tified 12 out of the 19 brain-damaged 
patients and misclassified 5 out of the 
19 functionals. Used together, the 
tests successfully identified 16 of the 
brain-damaged patients. 


DISCUSSION 


It is doubtful whether any aspect 
of psychological testing has been 
more inadequately treated than the 
diagnostic assessment of brain dam- 
age. From a wide range of possible 
criticisms, some of the most obvious 
will be cited: 


a. In many instances, validation studies 
have not used comparable groups. For exam- 
ple, the majority of cases in Hunt's (34) brain- 
damaged group were paretics and others suf- 
fering from diffuse brain damage. In his vali- 
dation study, Armitage (4), on the other hand, 
included many patients suffering from trau- 
matic head injury as the result of penetrating 
missiles, There is no reason to suppose ko. 
these two groups were at all comparable wit 
regard to type, locus, or severity of brain 
damage. Armitage's procedure may be justi- 
fied, however, in so far as Hunt described his 
tests as a measure of brain damage without 
further qualification. s f 

b. Few authors, when constructing their 
tests, choose their cases with sufficient Care 
in most instances considering brain damage 4 
a unitary factor. That this is unsound from 
the point of view of diagnosis may be shown 
by evidence from many sources. Thus, if $ 
random group of brain-damaged patients 1 
given a test, the results nearly always a 
into an abnormal distribution (i.e, 
skewed). Many of the brain-damaged grouP 
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behave on a given test like normal controls or 
functional patients, while others obtain very 
abnormal scores. Frequently, this sort of 
pattern accounts for the significance of dif- 
ferences between the groups. This kind of 
distribution was encountered in the Block De- 
sign Rotation Test and was recently seen in a 
study by Battersby, Teuber, and Bender (6) 
on the behavior of brain-damaged patients in 
a problem-solving situation. From an ana- 
tomical and physiological standpoint, there 
is likewise no reason why all brain-damaged 
patients should belong together. As Penfield 
and Evans (48) pointed out, there is a wealth 
of difference between the brain damage result- 
ing from scar formation on the temporal lobe 
following an accident, and the scar formation 
resulting from a temporal lobectomy. Simi- 
larly, there is no reason to suppose that a 
leucotomy operation will have the same effect 
as diffuse brain damage. 

c. Many investigators neglect almost com- 
pletely the elementary necessity for evaluat- 
ing and controlling various relevant factors 
such as age, sex, and intelligence. The Ship- 
ley-Hartford Retreat Scale, the Rorschach, 
and the Goldstein test are noteworthy exam- 
ples where failure to do this has led to am- 
biguity in the clinical use of the test. That 
factors such as these cannot be neglected 
without serious risk of error is apparent from 
an article by Hebb (30). Using simple pat- 
terns that had to be reproduced with pieces 
of wood, Hebb found that “no pattern could 
be devised, which was so easy that all patients 
in the public wards of a general hospital could 
Succeed with it in one minute, even though 
other tests showed that one was not dealing 
with a population of mental defectives” (30, 
P. 16). Hebb concludes that, although this 
kind of material tends to be eliminated in tests 
which are adequately standardized, “in spe- 
un tests which have not been standardized,’ 
ü ia a real danger of assuming that a varia- 
M rum the norm, which is. frequently ob- 
ad or the normal population, can be due 

y to the effects of cerebral injury" (30, 
E In addition, many authors fail to 
^s € whether or not their subjects were tested 
Hei defects, even when this is clearly 
Rea The Trail Making and Patch tests, 

instance, might well be performed badly 

y normal persons with defective eyesight. 
“eines tests might be peculiarly successful in 
ahd soldiers suffering from penetrating 
ithe, 504 brain-damaged, and be completely 
ccessful in cases of diffuse brain damage. 
sub; any tests employ very unreliable and 
tic SPA scoring systems. This applies par- 
ularly to the Rorschach, where most weight 
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is given to factors such as impotence, per- 
plexity, etc., but also to the Hunt-Minnesota 
"Test, where, for example, more credit is given 
for a response within one-half of a second than 
for a response within one-half to one second. 
e. Occasionally, there is failure to realize 
that a test may discriminate between two 
groups statistically at a high level of signifi- 
cance, and yet still be unusable clinically be- 
cause the misclassification would be very high. 
f. In many tests, there is failure to control 
relevant variables. Thus, in the Hunt-Minne- 
sota test, it is claimed that immediate and 
long-term memory are being measured. Ex- 
amination of the items, however, reveals 
failure to control for differential learning abil- 


ity. 


These criticisms are made on 
methodological grounds; there seems 
no reason why they should not be 
overcome. Assuming that they were 
overcome, would it be possible to de- 
velop adequate tests of brain dam- 
age? The answer would appear to be 
in the negative, as long as a purely 
engineering approach is made. Thus 
a most painstaking study by Lynn, 
Levine, and Hewson (42) was con- 
cerned with the aftereffects of ex- 
posure to blast during the war. Such 
exposure might result in a closed 
head injury with accompanying resid- 
ual symptomatology determined by 
the brain damage alone; or in a neu- 
rotic syndrome, identical clinically 
to that attributable to brain damage, 
but without any actual trauma to the 
brain; or (most commonly) in a com- 
bination of the two. Starting with 
over 4,000 patients, they reduced 
this group to 81 "pure" cases and 
attempted to differentiate these by 
means of a battery of tests. Elabor- 
ate precautions were taken to mini- 
mize unreliability of diagnosis and 
to control for age, sex, intelligence, 
etc., and a large number of independ- 
ent validating groups were used. 
Nevertheless, for these “pure” groups 
the misclassification was still 12 per 
cent. For the validation groups, itis 
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arguable that the statistical tech- 
niques were inadequate, but the re- 
sults are still disappointing. 

The basic flaw in most of the tests 
discussed above lies, however, in the 
theoretical approach. Examination 
of the tests suggests that one major 
hypothesis concerning the psycho- 
logical effects of brain damage per- 
meates the work; this is, that brain 
damage results in deterioration of a 
relatively permanent nature. Such a 
theory is unlikely to provide a satis- 
factory basis for the construction of 
tests of brain damage because it is 
not exclusive to brain damage. This 
criticism seems more important than 
the usual one that deterioration is 
an ill-defined concept—so much so 
that Hunt (35) preferred the neutral 
term "psychological deficit." Simi- 
larly, provided the test is rigorously 
scored, the assumption that vocabu- 
lary level is resistant to the effects of 
mental illness has been shown to be 
false by the work of Yacorzynski (69), 
Capps (12), and Simmins (66); while 
Crown (14), using 4 cases of myx- 
edema (which causes apparent intel- 
lectual deterioration) tested before 
and after treatment, found that the 
Shipley vocabulary MA of these pa- 
tients rose by an average of 10.2 
months following treatment by thy- 
roxine, and on another vocabulary 
test, the mean vocabulary IQ rose by 
6and 14 points in two of the cases. 

The first essential in the construc- 
tion of tests of brain damage, there- 
fore, is the development of a theory 
which is exclusive to brain damage. 
One possible theory has been pre- 
sented by Shapiro (61, 62, 63) in 
his articles, and may be stated as 
follows: Brain damage results in the 
development of states of exaggerated 
inhibition. Such a theory is more sat- 
isfactory than a theory of deterio- 
ration (in so far as it is supported 
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by experimental evidence) because 
it would not be postulated to account 
for functional disorder or for the be- 
havior of normals. Hence, it becomes 
possible to make deductions and set 
up experimental situations in which 
brain-damaged patients behave dif- 
ferently from others, and through 
which verification of the theory and 
development of tests of brain damage 
are feasible. Having done this and 
verified the theory in its broad out- 
lines, it now becomes possible to 
consider alternative hypotheses and 
also to consider why some brain- 
damaged patients do not behave as 
predicted. One immediate possibil- 
ity, of course, is location of the dam- 
age. Halstead (28) and Rylander 
(58, 59) concur in demonstrating that 
it appears to be much easier to dis- 
tinguish brain damage in the frontal 
lobes from that in other areas of the 
brain. Another possibility is the sig- 
nificance of previous personality. 
The interaction of personality struc- 
ture and the effects of brain damage 
have been largely ignored by most 
investigators, although it is known 
that extensive brain damage may 
have little or no effect on functioning, 
and that a brain-damaged patient 
may be neurotic or psychotic. Now 
Eysenck (18) has shown that there 
are tests which discriminate between 
psychotics and normals but not be- 
tween neurotics and normals, am 
vice versa; he has argued, on this 
basis, that neuroticism and psychot- 
icism are orthogonal (independent) 
factors. At this stage, therefore, it 
would be possible to construct à bat- 
tery of tests which would give a pat 
ticular patient a factor score on eat 
of these three dimensions—psychot* 
icism, neuroticism, and brain dam- 
age. The question to be answer 

would then become, not, is the Pa 
tient psychotic or neurotic or brain 


ed, but what is his perform- 
e with respect to these three vari- 
is, and what is their interrelation- 

At this point, the internal valid- 
of the tests would become im- 
t since it would be necessary 
}show that those tests measuring 
je effects of brain damage have sig- 
icantly positive intercorrelations 
y themselves, but not with the 
ts of psychoticism or neuroticism. 
is clear, therefore, that the prob- 
lis much more complex than was 
gested in the early part of this 
In the view of the writer, a 
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purely empirical approach is unlikely 
to yield satisfactory results, nor is an 
approach based on a theory which 
has not been adequately tested ex- 
perimentally. A satisfactory test of 
brain damage should be based on a 
reasonable theory that has been ex- 
perimentally tested, has been sup- 
ported by adequate statistical treat- 
ment, and has taken into account all 
relevant variables. Such an ap- 
proach would at least help to over- 
come the impasse which seems to 
have been reached with many of the 
tests reviewed above. 
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Many social scientists other than 
psychologists try to account for the 
behavior of individuals. Economists 
and a few psychologists have pro- 
duced a large body of theory and a 
few experiments that deal with indi- 
vidual decision making. The kind of 
decision making with which this body 
of theory deals is as follows: given 
two states, A and B, into either one of 
which an individual may put himself, 
the individual chooses A in prefer- 
ence to B (or vice versa). For in- 
stance, a child standing in front of a 
candy counter may be considering 
two states. In state A the child has 
$0.25 and no candy. In state B the 
child has $0.15 and a ten-cent candy 
bar. The economic theory of decision 
making is a theory about how to pre- 
dict such decisions. 

Economic theorists have been con- 
cerned with this problem since the 
days of Jeremy Bentham (1748- 
1832). In recent years the develop- 
ment of the economic theory of con- 
sumer's decision making (or, as the 


1 This work was supported by Contract 
N5ori-166, Task Order I, between the Office 
of Naval Research and The Johns Hopkins 
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while I was writing this paper. M. Allais, 
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May, A. Papandreou, L. J. Savage, and es- 
pecially C. H. Coombs have kindly made 
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A number of psychologists, economists, and 
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sometimes unheeded, criticism. Especially 
helpful were C. Christ, C. H. Coombs, F. 
Mosteller, and L. J. Savage. 
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economists call it, the theory of. 
sumer's choice) has become exc 
ingly elaborate, mathematical, 
voluminous. This literature is alm 
unknown to psychologists, in sp 
sporadic pleas in both psychologi 
(40, 84, 103, 104) and econom 
(101, 102, 123, 128, 199, 202) lit 
ture for greater communicati 
tween the disciplines. i 
The purpose of this paper is to 
view this theoretical literature, @ 
also the rapidly increasing numb 
psychological experiments (pet 
by both psychologists and 
mists) that are relevant to it. 
review will be divided into five 
tions: the theory of riskless ch 
the application of the theory of 
less choices to welfare economics, 
theory of risky choices, transiti 
decision making, and the th 
games and of statistical 
functions. Since this literature is 
familiar and relatively inaccessib 
most psychologists, and since I 
not find any thorough bibliogi 
on the theory of choice in the 
nomic literature, this paper mc! 
a rather extensive bibliography of tli 
literature since 1930. í 


Tur Tueory or RISKLESS CHO 


Economic man. x 
those theorists who have been € 


2 No complete review of this lite 
available. Kauder (105, 106) has 
very early history of utility theory. 
(180) and Viner (194) have revie 
literature up to approximately 1930. $ 
son's book (164) contains an illum 
mathematical exposition of some OF 
tent of this theory. Allen (6) explains t 
cept of indifference curves. Schultz (1 
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cerned with the theory of decision 
making is essentially an armchair 
method. They make assumptions, 
and from these assumptions they de- 
duce theorems which presumably can 
be tested, though it sometimes seems 
unlikely that the testing will ever 
occur. The most important set of 
assumptions made in the theory of 
riskless choices may be summarized 
by saying that it is assumed that the 
person who makes any decision to 
which the theory is applied is an 
economic man. 

What is an economic man like? He 
has three properties. (a) He is com- 
pletely informed. (b) He is infinitely 
sensitive. (c) He is rational. 

Complete information. Economic 
man is assumed to know not only 
what all the courses of action open to 
him are, but also what the outcome of 
any action will be. Later on, in the 
sections on the theory of risky choices 
and on the theory of games, this as- 
sumption will be relaxed somewhat. 
(For the results of attempts to in- 
troduce the possibility of learning 
into this picture, see 51, 77.) 

Infinite sensitivity. In most of the 
older work on choice, it is assumed 
that the alternatives available to an 
individual are continuous, infinitely 
divisible functions, that prices are 
infinitely divisible, and that economic 
man is infinitely sensitive. The only 
purpose of these assumptions is to 
make the functions that they lead to, 


———————————Ó 


Views the developments up to but not includ- 
ing the Hicks-Allen revolution from the point 
of view of demand theory. Hicks's book (87) 
is a complete and detailed exposition of most 
of the mathematical and economic content of 
the theory up to 1939. Samuelson (167) has 
reviewed the integrability problem and the re- 
ne preference approach. And Wold (204, 

5, 206) has summed up the mathematical 
content of the whole field for anyone who is 
E at home with axiom systems and 

ifferential equations. 
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continuous and differentiable. Stone 
(182) has recently shown that they 
can be abandoned with no serious 
changes in the theory of choice. 

Rationality. The crucial fact about 
economic man is that he is rational. 
This means two things: He can 
weakly order the states into which he 
can get, and he makes his choices so 
as to maximize something. 

Two things are required in order 
for economic man to be able to put all 
available states into a weak ordering. 
First, given any two states into which 
he can get, A and B, he must always 
be able to tell either that he prefers 
A to B, or that he prefers B to A, or 
that he is indifferent between them. 
If preference is operationally defined 
as choice, then it seems unthinkable 
that this requirement can ever be 
empirically violated. The second 
requirement for weak ordering, a 
more severe one, is that all prefer- 
ences must be transitive. If economic 
man prefers A to B and B to C, then 
he prefers A to C. Similarly, if he is 
indifferent between A and B and 
between B and C, then he is in- 
different between A and C. It is not 
obvious that transitivity will always 
hold for human choices, and experi- 
ments designed to find out whether 
or not it does will be described in the 
section on testing transitivity. 

The second requirement of ra- 
tionality, and in some ways the more 
important one, is that economic man 
must make his choices in such a way 
as to maximize something. This is 
the central principle of the theory of 
choice. In the theory of riskless 
choices, economic man has usually 
been assumed to maximize utility. In 
the theory of risky choices, he is as- 
sumed to maximize expected utility. 
In the literature on statistical de- 
cision making and the theory of 
games, various other fundamental 
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principles of decision making are 
considered, but they are all maximi- 
zation principles of one sort or an- 
other. 

The fundamental content of the 
notion of maximization is that eco- 
nomic man always chooses the best 
alternative from among those open 
to him, as he sees it. In more techni- 
cal language, the fact that economic 
man prefers A to B implies and is 
implied by the fact that A is higher 
than B in the weakly ordered set 
mentioned above. (Some theories in- 
troduce probabilities into the above 
statement, so that if A is higher than 
B in the weak ordering, then eco- 
nomic man is more likely to choose A 
than B, but not certain to choose A.) 

This notion of maximization is 
mathematically useful, since it makes 
it possible for a theory to specify a 
unique point or a unique subset of 
points among those available to the 
decider. It seems to me psychologi- 
cally unobjectionable. So many differ- 
ent kinds of functions can be maxi- 
mized that almost any point actually 
available in an experimental situation 
can be regarded as a maximum of 
some sort. Assumptions about maxi- 
mization only become specific, and 
therefore possibly wrong, when they 
specify what is being maximized. 

There has, incidentally, been al- 
most no discussion of the possibility 
that the two parts of the concept of 
rationality might conflict. It is con- 
ceivable, for example, that it might 
be costly in effort (and therefore in 
negative utility) to maintain a weakly 
ordered preference field. Under such 
circumstances, would it be “rational” 
to have such a field? 

It is easy for a psychologist to point 
out that an economic man who has 
the properties discussed above is very 
unlike a real man. In fact, it is so 
easy to point this out that psycholo- 
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hand the theories that resi 
these assumptions. This ism 
Surely the assumptions conta 
Hullian behavior theory (91) o 
the Estes (60) or Bush-Mosi 
(36, 37) learning theories are no 
realistic than these. The most u 
thing to do with a theory is 
criticize its assumptions but 
to test its theorems. If the th 
fit the data, then the theory 
least heuristic merit. Of cou 
trivial theorem deducible 
assumptions embodied in the o 
of economic man is that 
specific case of choice these a 
tions will be satisfied. For jm 


men, then real men should 2 
exhibit transitivity of real choice 
Transitivity is an assumption, but 
is directly testable. So are the oth 
properties of economic man 
model for real men. 
Economists themselves are 
what distrustful of economi 


ber of attempts to relax these 
sumptions. : ^ 
Early utility maximization t 


started by Jeremy Bentham _ 
popularized by James Mill and ot 
held that the goal of human ac 
to seek pleasure and avoid 

Every object or action may be | 
sidered from the point of vi 
pleasure- or pain-giving prop 
These properties are called the # 
of the object, and pleasure is give 
by positive utility and pain by neg 
tive utility. The goal of action, 
is to seek the maximum utility. 
simple hedonism of the fut 
easily translated into a the 
choice. People choose the alt 
from among those open to the 


THEORY OF DECISION MAKING 


leads to the greatest excess of positive 
over negative utility. This notion of 
utility maximization is the essence of 
the utility theory of choice. It will 
reappear in various forms through- 
out this paper. (Bohnert [30] dis- 
cusses the logical structure of the 
utility concept.) 

This theory of choice was embodied 
in the formal economic analyses of all 
the early great names in economics. 
In the hands of Jevons, Walras, and 
Menger it reached increasingly so- 
phisticated mathematical expression 
and it was embodied in the thinking 
of Marshall, who published the first 
edition of his great Principles of 
Economics in 1890, and revised it at 
intervals for more than 30 years 
thereafter (137). 

The use to which utility theory was 
put by these theorists was to estab- 
lish the nature of the demand for 
Various goods. On the assumption 
that the utility of any good is a 
monotonically increasing negatively 
accelerated function of the amount of 
that good, it is easy to show that the 
amounts of most goods which a con- 
sumer will buy are decreasing func- 
tions of price, functions which are 
precisely specified once the shapes of 
the utility curves are known. This is 
the result the economists needed and 
1$, of course, a testable theorem. (For 
More on this, see 87, 159.) 

Complexities arise in this theory 
when the relations between the 
utilities of different goods are con- 
En Jevons, Walras, Menger, 
nd even Marshall had assumed that 
the utilities of different commodities 
can be combined into a total utility 
Y simple addition; this amounts to 
oming that the utilities of different 
the A are independent (in spite of 
à act that Marshall elsewhere dis- 

ssed the notions of competing 
goods, like soap and detergents, and 
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completing goods, like right and left 
shoes, which obviously do not have 
independent utilities). Edgeworth 
(53), who was concerned with such 
nonindependent utilities, pointed out 
that total utility was not necessarily 
an additive function of the utilities 
attributable to separate commodities. 
In the process he introduced the no- 
tion of indifference curves, and thus 
began the gradual destruction of the 
classical utility theory. We shall re- 
turn to this point shortly. 

Although the forces of parsimony 
have gradually resulted in the elimi- 
nation of the classical concept of 
utility from the economic theory of 
riskless choices, there have been a 
few attempts to use essentially the 
classical theory in an empirical way. 
Fisher (63) and Frisch (75) have de- 
veloped methods of measuring margi- 
nal utility (the change in utility [u] 
with an infinitesimal change in 
amount possessed [Q], i.e., du/dQ) 
from market data, by making assump- 
tions about the interpersonal simi- 
larity of consumer tastes. Recently 
Morgan (141) has used several vari- 
ants of these techniques, has dis- 
cussed mathematical and logical flaws 
in them, and has concluded on the 
basis of his empirical results that the 
techniques require too unrealistic 
assumptions to be workable. The 
crux of the problem is that, for these 
techniques to be useful, the com- 
modities used must be independent 
(rather than competing or complet- 
ing), and the broad commodity clas- 
sifications necessary for adequate 
market data are not independent. 
Samuelson (164) has shown that the 
assumption of independent utilities, 
while it does guarantee interval scale 
utility measures, puts unwarrantably 
severe restrictions on the nature of 
the resulting demand function. Else- 
where Samuelson (158) presented, 
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primarily as a logical and mathe- 
matical exercise, a method of measur- 
ing marginal utility by assuming 
some time-discount function. Since 
no reasonable grounds can be found 
for assuming one such function rather 
than another, this procedure holds no 
promise of empirical success. Mar- 
shall suggested (in his notion of “‘con- 
sumer's surplus") a method of utility 
measurement that turns out to be 
dependent on the assumption of con- 
stant marginal utility of money, and 
which is therefore quite unworkable. 
Marshall's prestige led to extensive 
discussion and debunking of this 
notion (e.g., 28), but little positive 
comes out of this literature. Thur- 
stone (186) is currently attempting 
to determine utility functions for 
commodities experimentally, but has 
reported no results as yet. 
Indifference curves. | Edgeworth's 
introduction of the notion of in- 
difference curves to deal with the 
utilities of nonindependent goods was 
mentioned above. An indifference 
curve is, in Edgeworth's formula- 
tion, a constant-utility curve. Sup- 
pose that we consider apples and 
bananas, and suppose that you get 
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+ 
the same amount of utility f 
10-apples-and-1-banana as 
from 6-apples-and-4-bananas, 
these are two points on an indif 
ence curve, and of course 


the same curve. Naturally, this 
the only indifference curve you 
have between apples and bananas. 
may also be true that you 
different between 13-apple: 
bananas and 5-apples-and-15 
as. These are two points on 
higher indifference curve. A 
family of such curves is called 
difference map. Figure 1 p 
such a map. One particularly 
kind of indifference map has 
of a commodity on one axi 
amounts of money on the 
Money is a commodity, too. 
The notion of an indifference 
can be derived, as Edgeworth dei 
it, from the. notion of measurab 


Pareto (146, see also 151) was ser 
ously concerned about the assul 
tion that utility was measurable 
to a linear transformation. He 
that people could tell whether 
preferred to be in state A or state 
but could not tell how much 
preferred one state over the other. 
other words, he hypothesized a util t, 
function measurable only on an o 
nal scale. Let us follow the 
economic language, and call uti 
measured on an ordinal scale ord 
utility, and utility measured on 
interval scale, cardinal utility. 
meaningless to speak of the slope, ' 
marginal utility, of an ordinal u 
function; such a function canno 
differentiated. However, Pareto 
that the same conclusions which 
been drawn from marginal u 
could be drawn from indif 
curves. An indifference map can 
drawn simply by finding all the c 
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binations of the goods involved 
among which the person is indiffer- 
ent. Pareto's formulation assumes 
that higher indifference curves have 
greater utility, but does not need to 
specify how much greater that utility 
is. 

It turns out to be possible to de- 
duce from indifference curves all of 
the theorems that were originally de- 
duced from cardinal utility measures. 
This banishing of cardinal utility was 
furthered considerably by splendid 
mathematical papers by Johnson 
(97) and Slutsky (177). (In modern 
economic theory, it is customary to 
think of an n-dimensional commodity 
space, and of indifference hyper- 
planes in that space, each such hyper- 
plane having, of course, » —1 dimen- 
sions. Inorder to avoid unsatisfactory 
preference structures, it is necessary 
to assume that consumers always 
have a complete weak ordering for all 
commodity bundles, or points in com- 
modity space. Georgescu-Roegen 
[76], Wold [204, 205, 206, 208], 
Houthakker [90], and Samuelson 
[167] have discussed this problem.) 

_ Pareto was not entirely consistent 
in his discussion of ordinal utility. 
Although he abandoned the assump- 
tion that its exact value could be 
known, he continued to talk about 
the sign of the marginal utility co- 
efficient, which assumed that some 
knowledge about the utility function 
other than purely ordinal knowledge 
was available. He also committed 
er inconsistencies. So Hicks and 
tipar (88), in 1934, were led to their 
A ssic paper in which they attempted 
i purge the theory of choice of its 
cu introspective elements. They 
peed the conventional economic 
1 ew about indifference curves as de- 
€rmined from a sort of imaginary 
doi Honnaire, and proceeded to de- 
Ive all of the usual conclusions about 
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consumer demand with no reference 
to the notion of even ordinal utility 
(though of course the notion of an 
ordinal scale of preferences was still 
embodied in their derivation of in- 
difference curves). This paper was 
for economics something like the be- 
haviorist revolution in psychology. 

Lange (116), stimulated by Hicks 
and Allen, pointed out another incon- 
sistency in Pareto. Pareto had as- 
sumed that if a person considered 
four states, A, B, C, and D, he could 
judge whether the difference between 
the utilities of A and B was greater 
than, equal to, or less than the differ- 
ence between the utilities of C and D. 
Lange pointed out that if such a 
comparison was possible for any A, 
B, C, and D, then utility was car- 
dinally measurable. Since it seems 
introspectively obvious that such 
comparisons can be made, this paper 
provoked a flood of protest and com- 
ment (7, 22, 117, 147, 209). Never- 
theless, in spite of all the comment, 
and even in spite of skepticism by a 
distinguished economist as late as 
1953 (153), Lange is surely right. 
Psychologists should know this at 
once; such comparisons are the basis 
of the psychophysical Method of 
Equal Sense Distances, from which 
an interval scale is derived. (Samuel- 
son [162] has pointed out a very in- 
teresting qualification. Not only 
must such judgments of difference be 
possible, but they must also be transi- 
tive in order to define an interval 
scale.) But since such judgments of 
differences did not seem to be neces- 
sary for the development of consumer 
demand theory, Lange’s paper did 
not force the reinstatement of cardi- 
nal utility. 

Indeed, the pendulum swung 
further in the behavioristic direction. 
Samuelson developed a new analytic 
foundation for the theory of con- 
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sumer behavior, the essence of which 
is that indifference curves and hence 
the entire structure of the theory of 
consumer choice can be derived 
simply from observation of choices 
among alternative groups of pur- 
chases available to a consumer (160, 
161) This approach has been ex- 
tensively developed by Samuelson 
(164, 165, 167, 169) and others (50, 
90, 125, 126). The essence of the idea 
is that each choice defines a point 
and a slope in commodity space. 
Mathematical approximation meth- 
ods make it possible to combine a 
whole family of such slopes into an 
indifference hyperplane. A family of 
such hyperplanes forms an indiffer- 
ence “map.” 

In a distinguished but inaccessible 
series of articles, Wold (204, 205, 206; 
see also 208 for a summary presenta- 
tion) has presented the mathematical 
content of the Pareto, Hicks and Al- 
len, and revealed preference (Samu- 
elson) approaches, as well as Cassel's 
demand function approach, and has 
shown that if the assumption about 
complete weak ordering of bundles of 
commodities which was discussed 
above is made, then all these ap- 
proaches are mathematically equiva- 
lent. 

Nostalgia for cardinal utility. The 
crucial reason for abandoning cardi- 
nal utility was the argument of the 
ordinalists that indifference curve 
analysis in its various forms could do 
everything that cardinal utility could 
do, with fewer assumptions. So far 
as the theory of riskless choice is con- 
cerned, this is so. But this is only an 
argument for parsimony, and parsi- 
mony is not always welcome. There 
was a series of people who, for one 
reason or another, wanted to rein- 
state cardinal utility, or at least 
marginal utility. There were several 
mathematically invalid attempts to 
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show that marginal utility could 
defined even in an ordinal-uti 
universe (23, 24, 163; 25, 1 
Knight (110), in 1944, argued @ 
tensively for cardinal utility; 
based his arguments in part on if 
trospective considerations and inp 
on an examination of psychophysica 
scaling procedures. He stimulated} 
number of replies (29, 42; 111). R 
cently Robertson (154) pleaded fc 
the reinstatement of cardinal utili 
in the interests of welfare economi 
(this point will be discussed agai 
below). But in general the indiff 
ence curve approach, in its vario 
forms, has firmly established itself 
the structure of the theory of ris 
choice. 

Experiments on indifference roe 
Attempts to measure marginal utility 
from market data were discuss 
above. There have been three expen 
mental attempts to measure indiffer 
ence curves. Schultz, who pioneered 
in deriving statistical demand cu 
interested his colleague at the Univer 
sity of Chicago, the psychol gis 
Thurstone, in the problem of m 
difference curves. Thurstone (185 
performed a very simple experiment 
He gave one subject a series of com 
binations of hats and overcoats, ame 
required the subject to judge whe he 
he preferred each combination to? 
standard. For instance, the subjec 
judged whether he preferred eight 
hats and eight overcoats to fifteet 
hats and three overcoats. The sam! 
procedure was repeated for hats 
shoes, and for shoes and overcoa 
The data were fitted with indiffe 
curves derived from the assumptions 
that utility curves fitted Fechner' 
Law and that the utilities of th 
various objects were independ 
Thurstone says that Fechner's Law 
fitted the data better than the otaa 
possible functions he considered, 
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ts no evidence for this asser- 
tion, The crux of the experiment 
was the attempt to predict the in- 
difference curves between shoes'and 
overcoats from the other indifference 
curves. This was done by using the 
other two indifference curves to infer 
utility functions for shoes and for 
overcoats separately, and then using 
these two utility functions to predict 
the total utility of various amounts 
of shoes and overcoats jointly. The 
prediction worked rather well. The 
judgments of the one subject used are 
extraordinarily orderly; there is very 
little of the inconsistency and vari- 
ability that others working in this 
area have found.  Thurstone says, 
"The subject... was entirely naive 
as regards the psychophysical prob- 
lem involved and had no knowledge 
whatever of the nature of the curves 
that we expected to find" (185, p. 
154). He adds, “I selected as subject 
a research assistant in my laboratory 
Who knew nothing about psycho- 
Physics. Her work was largely 
clerical in nature. She had a very 
even disposition, and I instructed her 
to take an even motivational attitude 
on the successive occasions . . . I was 
Surprised at the consistency of the 
judgments that I obtained, but I am 
pretty sure that they were the result 
: careful instruction to assume auni- 
orm motivational attitude."?* From 
€ economist's point of view, the 
m criticism of this experiment is 
at it involved imaginary rather 
an real transactions (200). 
E. es experimental measure- 
by E. indifference curves is reported 
(i57). ici Rousseas and Hart 
Be hey required large numbers 
de ents to rank sets of three com- 
ations of different amounts of ba- 


Li 
Thurstone, L. L. Personal communication, 
mber 7, 1953, 
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con and eggs. By assuming that all 
students had the same indifference 
curves, they wereable to derive a com- 
posite indifference map for bacon and 
eggs. No mathematical assumptions 
were necessary, and the indifference 
map is not given mathematical form. 
Some judgments were partly or com- 
pletely inconsistent with the final map, 
but not too many. The only conclu- 
sion which this experiment justifies is 
that it is possible to derive such a 
composite indifference map. 

The final attempt to measure an 
indifference curve is a very recent one 
by the psychologists Coombs and 
Milholland (49). The indifference 
curve-involved is one between risk 
and value of an object, and so will be 
discussed below in the section on the 
theory of risky decisions. It is men- 
tioned here because the same meth- 
ods (which show only that the in- 
difference curve is convex to the 
origin, and so perhaps should not be 
called measurement) could equally 
well be applied to the determination 


of indifference curves in riskless 
situations. 
Mention should be made of the 


extensive economic work on statisti- 
cal demand curves. For some reason 
the most distinguished statistical de- 
mand curve derivers feel it necessary 
to give an account of consumer's 
choice theory as à preliminary to the 
derivation of their empirical demand 
curves. The result is that the two 
best books in the area (172, 182) are 
each divided into two parts; the first 
is a general discussion of the theory 
of consumer's choice and the second 
a quite unrelated report of statistical 
economic work. Stigler (179) has 
given good reasons why the statistical 
demand curves are so little related to 
the demand curves of economic 
theory, and Wallis and Friedman 
(200) argue plausibly that this state 
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of affairs is inevitable. At any rate, 
there seems to be little prospect of 
using large-scale economic data to fill 
in the empirical content of the theory 
of individual decision making. 

Psychological comments. There are 
several commonplace observations 
that are likely to occur to psycholo- 
gists as soon as they try to apply the 
theory of riskless choices to actual 
experimental work. The first is that 
human beings are neither perfectly 
consistent nor perfectly sensitive. 
This means that indifference curves 
are likely to be observable as in- 
difference regions, or as probability 
distributions of choice around a 
central locus. It would be easy to 
assume that each indifference curve 
represents the modal value of a nor- 
mal sensitivity curve, and that choices 
should have statistical properties 
predictable from that hypothesis as 
the amounts of the commodities 
(locations in product space) are 
changed. This implies that the defi- 
nition of indifference between two 
collections of commodities should be 
that each collection is preferred over 
the other 50 per cent of the time. 
Such a definition has been proposed 
by an economist (108), and used in 
experimental work by psychologists 
(142). Of course, 50 per cent choice 
has been a standard psychological 
definition of indifference since the 
days of Fechner. 

Incidentally, failure on the part of 
an economist to understand that a 
just noticeable difference (j.n.d.) isa 
statistical concept has led him to 
argue that the indifference relation is 
intransitive, that is, that if A is in- 
different to B and B is indifferent to 
C, then A need not be indifferent to C 
(8, 9, 10). He argues that if 4 and B 
are less than one j.n.d. apart, then A 
will be indifferent to B; the same of 
course is true of B and C; but A and 
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C may be more than one j.n. 
and so one may be preferred. 
other. This argument is, of 1 
wrong. If A has slightly m 
than B, then the individu 
choose A in preference to B slig 
more than 50 per cent of the tii 
even though 4 and B are less 
one j.n.d. apart in utility. The 50 
cent point is in theory a p 
defined point, not a region. It 
fact be difficult to determine 
of inconsistencies in judg 
because of changes in taste with ti 

The second psychological obser 
tion is that it seems impossible € 
to dream of getting experi 
an indifference map in n-dimet 
space where z is greater than 3. 
the case of 1 —3 presents fo 
experimental problems. This 
important to the psychologist wht 
wants to use the theory of cho 
rationalize experimental data 
to the economist who wants to 
rive a theory of general static equ 
rium. 

Experiments like Thurstone's ( 
involve so many assumptions th 
is difficult to know what their e 
cal meaning might be if these as 
tions were not made. Presuma 
the best thing to do with such 
periments is to consider them as | 
of the assumption with the least. 
validity. Thurstone was will 
assume utility maximization a 
dependence of the commodities 
volved (incidentally, his choi 
commodities seems singularly 
fortunate for justifying an ass! 
tion of independent utilities), 4 
used his data to construct a utl 
function. Of course, if only orc 
utility is assumed, then experime 
indifference curves cannot be ! 
this way. In fact, in an 0 
utility universe neither of the 
cipal assumptions made by Th 
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ean be tested by means of experi- 
mental indifference curves, So the 
assumption of cardinal utility, though 
not necessary, seems to lead to con- 
siderably more specific uses for ex- 
perimental data. 

At any rate, from the experimental 
point of view the most interesting 
question is: What is the observed 
shape of indifference curves between 
independent commodities? This ques- 
tion awaits an experimental answer. 

The notion of utility is very similar 
to the Lewinian notion of valence 
(120, 121). Lewin conceives of 
valence as the attractiveness of an 
object or activity to a person (121). 
Thus, psychologists might consider 
the experimental study of utilities to 
be the experimental study of valences, 
and therefore an attempt at quantify- 
ing parts of the Lewinian theoretical 
schema. 


APPLICATION OF THE THEORY OF 
RiskLEss CHOICES TO WEL- 
FARE ECONOMICS 


The classical utility theorists as- 
sumed the existence of interpersonally 
comparable cardinal utility. They 
were thus able to find a simple an- 
Swer to the question of how to de- 
termine the best economic policy: 
That economic policy is best which 
results in the maximum total utility, 
Summed over all members of the 
economy. 

The abandonment of interpersonal 
Comparability makes this answer use- 
less. A sum is meaningless if the 
units being summed are of varying 
Sizes and there is no way of reducing 
them to some common size. This 


in ae discussion of welfare economics given 
\ this paper is exceedingly sketchy. For a 
e of what the complexities of modern 
i are economics are really like (see 11, 13, 
"wa He 124, 127, 139, 140, 148, 154, 155, 
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point has not been universally recog- 
nized, and certain economists (e.g., 
82, 154) still defend cardinal (but not 
interpersonally comparable) utility 
on grounds of its necessity for wel- 
fare economics. 

Pareto's principle. The abandon- 
ment of interpersonal comparability 
and then of cardinal utility produced 
a search for some other principle to 
justify economic policy. ^ Pareto 
(146), who first abandoned cardinal 
utility, provided a partial solution. 
He suggested that a change should 
be considered desirable if it left 
everyone at least as well off as he 
was before, and made at least one 
person better off. 

Compensation principle. Pareto’s 
principle is fine as far as it goes, but 
it obviously does not go very far. 
The economic decisions which can be 
made on so simple a principle are few 
and insignificant. So welfare eco- 
nomics languished until Kaldor (98) 
proposed the compensation prin- 
ciple. This principle is that if it is 
possible for those who gain from an 
economic change to compensate the 
losers for their losses and still have 
something left over from their gains, 
then the change is desirable. Of 
course, if the compensation is actually 
paid, then this is simply a case of 
Pareto's principle. 

But Kaldor asserted that the 
compensation need not actually be 
made; all that was necessary was 
that it could be made. The fact that 
it could be made, according to 
Kaldor, is evidence that the change 
produces an excess of good over harm, 
and so is desirable. Scitovsky (173) 
observed an inconsistency in Kaldor's 
position: Some cases could arise in 
which, when a change from A to B 
has been made because of Kaldor's 
criterion, then a change back from B 
to A would also satisfy Kaldor's 
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criterion. It is customary, therefore, 
to assume that changes which meet 
the original Kaldor criterion are only 
desirable if the reverse change does 
not also meet the Kaldor criterion. 

It has gradually become obvious 
that the Kaldor-Scitovsky criterion 
does not solve the problem of welfare 
economics (see e.g., 18, 99). It as- 
sumes that the unpaid compensation 
does as much good to the person who 
gains it as it would if it were paid to 
the people who lost by the change. 
For instance, suppose that an in- 
dustrialist can earn $10,000 a year 
more from his plant by using a new 
machine, but that the introduction of 
the machine throws two people ir- 
retrievably out of work. If the salary 
of each worker prior to the change 
was $4,000 a year, then the in- 
dustrialist could compensate the 
workers and still make a profit. But 
if he does not compensate the work- 
ers, then the added satisfaction he 
gets from his extra $10,000 may be 
much less than the misery he pro- 
duces in his two workers. This ex- 
ample only illustrates the principle; 
it does not make much sense in these 
days of progressive income taxes, un- 
employment compensation, high em- 
ployment, and strong unions. 

Social welfare functions. From here 
on the subject of welfare economics 
gets too complicated and too remote 
from psychology to merit extensive 
exploration in this paper. The line 
that it has taken is the assumption 
of a social welfare function (21), a 
function which combines individual 
utilities in a way which satisfies 
Pareto's principle but is otherwise 
undefined. In spite of its lack of 
definition, it is possible to draw 
certain conclusions from such a func- 
tion (see e.g., 164). However, Arrow 
(14) has recently shown that a social 
welfare function that meets certain 
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very reasonable requirements about 
being sensitive in some way to the 
wishes of all the people affected, 
etc., cannot in general be found in 
the absence of interpersonally com- 
parable utilities (see also 89). 
Psychological comment. Some econ- 
omists are willing to accept the 
fact that they are inexorably com- 
mitted to making moral judgments 
when they recommend economic 
policies (e.g., 152, 153). Others still 
long for the impersonal amorality of a 
utility measure (e.g., 154). However 
desirable interpersonally comparable 
cardinal utility may be, it seems 
utopian to hope that any experi- 
mental procedure will ever give in- 
formation about individual utilities 
that could be of any practical use in 
guiding large-scale economic policy. 


Tur Turony or Risky CHOICES 


Risk and uncertainty. Economists 
and statisticians distinguish between 


5 Strotz (183) and Alchian (1) present non- 
technical and sparkling expositions of the von 
Neumann and Morgenstern utility measure- 
ment proposals. Georgescu-Roegen (78) criti- 
cally discustes various axiom systems so as to 
bring some of the assumptions underlying this 
kind of cardinal utility into clear focus. Allais 
(3) reviews some of these ideas in the course 0 
criticizing them. Arrow (12, 14) reviews parts 
of the field. 

There is a large psychological literature om 
one kind of risky decision making, the kir 
which results when psychologists use partial 
reinforcement. This literature has been e 
viewed by Jenkins and Stanley (96). Recently 
a number of experimenters, including Jarrett 
(95), Flood (69, 70), Bilodeau (27), and my- 
self (56) have been performing experiments 0n 
human subjects who are required to choose 
repetitively between two or more alternatives 
each of which has a probability of pucr 
greater than zero and less than one. The pro p 
lems raised by these experiments are too ER 
plicated and too far removed from conven 
tional utility theory to be dealt with in 
paper. This line of experimentation may even- 
tually provide the link which ties together 
utility theory and reinforcement theory. 
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risk and uncertainty. There does not 
seem to be any general agreement 
about which concept should be as- 
sociated with which word, but the 
following definitions make the most 
important distinctions. 


Almost everyone would agree that . 


when I toss a coin the probability 
that I will get a head is .5. A proposi- 
tion about the future to which a num- 
ber can be attached, a number that 
represents the likelihood that the 
proposition is true, may be called a 
first-order risk. What the rules are for 
| attaching such numbers is a much 
debated question, which will. be 
avoided in this paper. 
Some propositions may depend on 
more than.one probability distribu- 
tion. For instance, I may decide that 
if I get a tail, I will put the coin back 
- in my pocket, whereas if I get a head, 
I will toss it again. Now, the prob- 
ability of the proposition “I will get 
à head on my second toss” is a func- 
tion of two probability distributions, 
the distribution corresponding to the 
first toss and that corresponding to 
- the second toss. This might be called 
_ &second-order risk. Similarly, risks of 
any order may be constructed. Itisa 
mathematical characteristic of all 
higher-order risks that they may be 
compounded into first-order risks by 
Means of the usual theorems for com- 
Pounding probabilities. (Some econo- 
E have argued against this pro- 
€dure [83], essentially on the grounds 

at you may have more information 


Y the time the second risk comes | 


oa: Such problems can best be 
E p by means of von Neumann 
^u orgenstern's [197] concept of 
ategy, which is discussed below. 
ep in general problems of 
ainty, rather than risk.) 

“tag Propositions about the future 
tl which no generally accepted 
abilities can be attached. What 
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is the probability that the following 
proposition is true: Immediately 
after finishing this paper, you will 
drink a glass of beer? Surely it is 
neither impossible nor certain, so it 
ought to have a probability between 
zero and one, but it is impossible for 
you or me to find out what that prob- 
ability might be, or even to set up 
generally acceptable rules about how 
to find out. Such propositions are 
considered cases of uncertainty, rather 
than of risk. This section deals only 
with the subject of first-order risks. 
The subject of uncertainty will arise 
again in connection with the theory 
of games. 

Expected utility maximization. The 
traditional mathematical notion for 
dealing with games of chance (and so 
with risky decisions) is the notion 
that choices should be made so as to 
maximize expected value. The ex- 
pected value of a bet is found by 
multiplying the value of each possible 
outcome by its probability of oc- 
currence and summing these prod- 
ucts across all possible outcomes. In 
symbols: 


EV=pisitpSet ++ > bos 


where p stands for probability, $ 
stands for the value of an outcome, 
and pitpat +++ band. 

The assumption that people ac- 
tually behave the way this mathe- 
matical notion says they should is 
contradicted by observable behavior 
in many risky situations. People are 
willing to buy insurance, even though 
the person who sells the insurance 
makes a profit. People are willing to 
buy lottery tickets, even though the 
lottery makes a profit. Consideration 
of the problem of insurance and of the 
St. Petersburg paradox led Daniel 
Bernoulli, an eighteenth century 
mathematician, to propose that they 
could be resolved by assuming that 
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people act so as to maximize expected 
utility, rather than expected value 
(26). (He also assumed that utility 
followed a function that more than a 
century later was proposed by Fech- 
ner for subjective magnitudes in 
general and is now called Fechner's 
Law.) This was the first use of the 
notion of expected utility. 

The literature on risky decision 
making prior to 1944 consists pri- 
marily of the St. Petersburg paradox 
and other gambling and probability 
literature in mathematics, some liter- 
ary discussion in economics (e.g., 109, 
187), one economic paper on lotteries 
(189), and the early literature of the 
theory of games (31, 32, 33, 34, 195), 
which did not use the notion of 
utility. The modern period in the 
study of risky decision making began 
with the publication in 1944 of von 
Neumann and Morgenstern's monu- 
mental book Theory of Games and 
Economic Behavior (196, see also 
197), which we will discuss more fully 
later. Von Neumann and Morgen- 
stern pointed out that the usual as- 
sumption that economic man can 
always say whether he prefers one 
state to another or is indifferent be- 
tween them needs only to be slightly 
modified in order to imply cardinal 
utility. The modification consists of 
adding that economic man can also 
completely order probability com- 
binations of states. Thus, suppose 
that an economic man is indifferent 
between the certainty of $7.00 and a 
50-50 chance of gaining $10.00 or 
nothing. We can assume that his 
indifference between these two pros- 
pects means that they have the same 
utility for him. We may define the 
utility of $0.00 as zero utiles (the 
usual name for the unit of utility, just 
as sone is the name for the unit of 
auditory loudness), and the utility 
of $10.00 as 10 utiles, These two 
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arbitrary definitions correspond 
defining the two undefined constanti 
which are permissible since card 
utility is measured only up to a linear 
transformation. Then we may cab 
culate the utility of $7.00 by : 
the concept of expected utility as fol: 
lows: > 


U($7.00) =.5 U($10.00) +.5 U($0.00) 
=.5(10)-+.5(0) =5. 


Thus we have determined the cardi- 
nal utility of $7.00 and found thatit 
is 5 utiles. By varyiig the probabi 
ities and by using the already fout 
utilities it is possible to discover 
utility of any other amount of mon 
using only the two permissible a 
trary definitions. It is even more 
convenient if instead of +$10.00, 
— $10.00 or some other loss is used a 
one of the arbitrary utilities. 

A variety of implications is em 
bodied in this apparently simple 
tion. In the attempt to examine ane 
exhibit clearly what these implica 
tions are, a number of axiom systems; 
differing from von Neumann an 
Morgenstern's but leading to t 
same result, have been develop » 
(73, 74, 85, 135, 136, 171). This 
paper will not attempt to go Int} 
the complex discussions (e.g. 190 
131, 168, 207) of these various à 
ternative axiom systems. One recen 
discussion of them (78) has com 
cluded, on reasonable grounds, thé 
the original von Neumann and 
genstern set of axioms is still the best 

It is profitable, however, to € 
amine what the meaning of this n 
tion is from the empirical point © 
view if it is right. First, it means e 
risky propositions can be order? 
in desirability, just as riskless on? 
can. Second, it means that the C 
cept of expected utility is beha o 
ally meaningful. Finally, it mea? 
that choices among risky alternativa 
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are made in such a way that they 
maximize expected utility. 

If this model is to be used to pre- 
dict actual choices, what could go 
wrong with it? It might be that the 
probabilities by which the utilities 
are multiplied should not be the ob- 
jective probabilities; in other words, a 
decider's estimate of the subjective 
importance of a probability may not 
be the same as the numerical value of 
that probability. It might be that 
the method of combination of proba- 
bilities and values should not be 
simple multiplication. It might be 
that the method of combination of 
the probability-value products should 
not be simple addition. It might be 
that the process of gambling has 
Some positive or negative utility of 
itsown. It might be that the whole 
approach is wrong, that people just 
do not behave as if they were trying 
to maximize expected utility. We 
shall examine some of these pos- 
sibilities in greater detail below. 

, Economic implications of maximiz- 
ing expected utility. The utility- 
measurement notions of von Neu- 
Mann and Morgenstern were en- 
thusiastically welcomed by many 
economists (e.g., 73, 193), though a 
few (e.g., 19) were at least tempo- 
Tarily (20) unconvinced. The most 
interesting economic use of them was 
Proposed by Friedman and Savage 
(73), who were concerned with the 
question of why the same person who 
Uys insurance (with a negative ex- 
Pected money value), and therefore is 
Wing to pay in order not to take 
isks, will also buy lottery tickets 
aw with a negative expected money 
id in which he pays in order to 
d risks, They suggested that these 
infi ie be reconciled by a doubly 
th cted utility curve for money, like 

at in Fig. 2. If I represents the 
Person's current income, then he is 
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clearly willing to accept "fair" in- 
surance (i.e., insurance with zero ex- 
pected money value) because the 
serious loss against which he is insur- 
ing would have a lower expected 
utility than the certain loss of the 
insurance premium. (Negatively ac- 
celerated total utility curves, like 
that from the origin to J, are what 
you get when marginal utility de- 
creases; thus, decreasing marginal 
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Fic. 2. HYPOTHETICAL UTILITY CURVE FOR 
MONEY, PROPOSED BY FRIEDMAN AND SAVAGE 


utility is consistent with the avoid- 
ance of risks.) The person would also 
be willing to buy lottery tickets, since 
the expected utility of the lottery 
ticket is greater than the certain loss 
of the cost of the ticket, because of 
the rapid increase in the height of the 
utility function. Other considera- 
tions make it necessary that the 
utility curve turn down again. Note 
that this discussion assumes that 
gambling has no inherent utility. 
Markowitz (132) suggested an im- 
portant modification in this hy- 
pothesis. He suggested that the 
origin of a person’s utility curve for 
money be taken as his customary 
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financial status, and that on both 
sides of the origin the curve be as- 
sumed first concave and then convex. 
If the person's customary state of 
wealth changes, then the shape of his 
utility curve will thus remain gen- 
erally the same with respect to where 
he now is, and so his risk-taking be- 
havior will remain pretty much the 
same instead of changing with every 
change of wealth as in the Friedman- 
Savage formulation. 

Criticism of the  expected-utility 
maximization theory. It is fairly easy 
to construct examples of behavior 
that violate the von Neumann- 
Morgenstern axioms (for a partic- 
ularly ingenious example, see 183). It 
is especially easy to do so when the 
amounts of money involved are very 
large, or when the probabilities or 
probability differences involved are 
extremely small. Allais (5) has con- 
structed a questionnaire full of items 
of this type. For an economist in- 
terested in using these axioms as a 
basis for a completely general theory 
of risky choice, these examples may 
be significant. But psychological in- 
terest in this model is more modest. 
The psychologically important ques- 
tion is: Can such a model be used to 
account for simple experimental ex- 
amples of risky decisions? 

Of course a utility function derived 
by von Neumann-Morgenstern means 
is not necessarily the same as a classi- 
P utility function (74, 203; see also 
82). 

Experiment on the von Neumann- 
Morgenstern model. A number of ex- 
periments on risky decision making 
have been performed. Only the first 
of them, by Mosteller and Nogee 
(142), has been in the simple frame- 
work of the model described above. 
All the rest have in some way or 
another centered on the concept of 
probabilities effective for behavior 
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-out the first experiment to apply the 


which differ in some way from the 
objective probabilities, as well as on 
utilities different from the objective 
values of the objects involved. 
Mosteller and Nogee (142) carried 


von Neumann-Morgenstern model. 
They presented Harvard undergradu- 
ates and National Guardsmen with 
bets stated in terms of rolls at poker 
dice, which each subject could accept 
or refuse. Each bet gave a "hand" 
at poker dice. If the subject could 
beat the hand, he won an amount 
stated. in the bet. If not, he lost a 
nickel. Subjects played with $1.00, 
which they were given at the be- 
ginning of each experimental session. 
They were run together in groups of 


five; but each decided and rolled the 


poker dice for himself. Subjects were 
provided with a table in which the 
mathematically fair bets were shown, 
so that a subject could immediately 
tell by referring to the table whether 
a given bet was fair, or better or 
worse than fair. 

In the data analysis, the first step 
was the determination of “indiffer- 
ence offers." For each probability 
used and for each player, the amount 
of money was found for which that 
player would accept the bet 50 per 
cent of the time. Thus equality was 
defined as 50 per cent choice, as it 
is likely to be in all psychological ex- 
periments of this sort. Then the 
utility of $0.00 was defined as 0 
utiles, and the utility of losing à 
nickel was defined as —1 utile. Wi 
these definitions and the probabilities 
involved, it was easy to calculate the 
utility corresponding to the amount 
of money involved in the indifference 
offer. It turned out that, in gener? 
the Harvard undergraduates T 
diminishing marginal utilities, wee 
the National Guardsmen ha 
creasing marginal utilities. 
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The utilities thus calculated were 
used in predicting the results of more 
complex bets. It is hard to evaluate 
the success of these predictions. At 
any rate, an auxiliary  paired- 
comparisons experiment showed that 
the hypothesis that subjects maxi- 
mized expected utility predicted 
choices better than the hypothesis 
that subjects maximized expected 
money value. 

The utility curve that Mosteller 
and Nogee derive is different from 
the one Friedman and Savage (73) 
were talking about. Suppose that a 
subject's utility curve were of the 
Friedman-Savage type, as in Fig. 2, 
and that he had enough money to put 
him at point P. If he now wins or 
loses a bet, then he is moved to a 
different location on the indifference 
curve, say Q. (Note that the amounts 
of money involved are much smaller 
than in the original Friedman-Savage 
use of this curve.) However, the con- 
struction of a Mosteller-Nogee utility 
curve assumes that the individual is 
always at the same point on his 
utility curve, namely the origin. This 
Means that the curve is really of the 
Markowitz (132) type discussed 
above, instead of the Friedman- 
Savage type. The curve is not really 
a curve of utility of money in general, 

ut rather it is a curve of the utility- 

or-n-more dollars. Even so, it must 
€ assumed further that as the total 
amount of money possessed by the 
Subject changes during the experi- 
Ment, the utility-for-z-more dollars 
Curve does not change. Mosteller and 
E argue, on the basis of detailed 
i mination of some of their data, 

at the amount of money possessed 
is the subjects did not seriously 
B their choices. The utility 
in es they reported showed chang- 
dd marginal utility within the 

ounts of money used in their ex- 
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periment. Consequently, their con- 
clusion that the amount of money 
possessed by the subjects was not 
seriously important can only be true 
if their utility curves are utility- 
for-n-more dollars curves and if the 
shapes of such curves are not affected 
by changes in the number of dollars 
on hand. This discussion exhibits a 
type of problem which must always 
arise in utility measurement and 
which is new in psychological scaling. 
The effects of previous judgments on 
present judgments are a familiar 
story in psychophysics, but they are 
usually assumed to be contaminating 
influences that can be minimized or 
eliminated by proper experimental 
design. In utility scaling, the funda- 
mental idea of a utility scale is such 
that the whole structure of a subject's 
choices should be altered as a result 
of each previous choice (if the choices 
are real ones involving money gains 
or losses). The Markowitz solution 
to this problem is the most practical 
one available at present, and that 
solution is not entirely satisfactory 
since all it does is to assume that 
people's utilities for money operate 
in such a way that the problem does 
not really exist. This assumption is 
plausible for money, but it gets 
rapidly less plausible when other 
commodities with a less continuous 
character are considered instead. 
Probability preferences. In a series 
of recent experiments (55, 57, 58, 
59), thewriterhasshown that subjects, 
when they bet, prefer some probabil- 
ities to others (57), and that these 
preferences cannot be accounted for 
by utility considerations (59). All 
the experiments were basically of the 
same design. Subjects were required 
to choose between pairs of bets ac- 
cording to the method of paired com- 
parisons. The bets were of three 
kinds: positive expected value, nega- 
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tive expected value, and zero ex- 
pected value. The two members of 
each pair of bets had the same ex- 
pected value, so that there was never 
(in the main experiment [57, 59]) any 
objective reason to expect that choos- 
ing one bet would be more desirable 
than choosing the other. 

Subjects made their choices under 
three conditions: just imagining they 
were betting; betting for worthless 
chips; and betting for real money. 
They paid any losses from their own 
funds, but they were run in extra 
sessions after the main experiment to 
bring their winnings up to $1.00 per 
hour. 

The results showed that two fac- 
tors were most important in deter- 
mining choices: general preferences or 
dislikes for risk-taking, and specific 
preferences among probabilities. An 
example of the first kind of factor is 
that subjects strongly preferred low 
probabilities of losing large amounts 
of money to high probabilities of 
losing small amounts of money—they 
just didn't like to lose. It also turned 
out that on positive expected value 
bets, they were more willing to accept 
long shots when playing for real 
money than when just imagining or 
playing for worthless chips. An ex- 
ample of the second kind of factor 
is that they consistently preferred 
bets involving a 4/8 probability of 
winning to all others, and consistently 
avoided bets involving a 6/8 prob- 
ability of winning. These preferences 
were reversed for negative expected 
value bets. 

These results were independent of 
the amounts of money involved in 
the bets, so long as the condition of 
constant expected value was main- 
tained (59). When pairs of bets which 
differed from one another in expected 
value were used, the choices were a 
compromise between maximizing ex- 
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pected amount of money and betting 
at the preferred probabilities (58). 
An attempt was made to construct 
individual utility curves adequate to 
account for the results of several sub- 
jects. For this purpose, the utility of 
$0.30 was defined as 30 utiles, and it 
was assumed that subjects cannot 
discriminate utility differences small- 
er than half a utile. Under these as- 
sumptions, no individual utility curves 
consistent with the data could be 
drawn. Various minor experiments 
showed that these results were relia- 
ble and not due to various possible 
artifacts (59). No attempt was made 
to generate a mathematical model of 
probability preferences. 

The existence of probability prefer- 
ences means that the simple von 
Neumann-Morgenstern method of 
utility measurement cannot succeed. 
Choices between bets will be deter- 
mined not only by the amounts of 
money involved, but also by the 
preferences the subjects have among 
the probabilities involved. Only an 
experimental procedure which holds 
one of these variables constant, oF 
otherwise allows for it, can hope to 
measure the other. Thus my expert 
ments cannot be regarded as a way 
of measuring probability preferences 
they show only that such preferences 
exist. à 

It may nevertheless be possible E 
get an interval scale of the utility 0 
money from gambling experiments by 
designing an experiment which meas- 
ures utility and probability prefer- 
ences simultaneously. Such expe 
ments are likely to be complicat 
and difficult to run, but they can 
designed. a 

Subjective probability. First, 2 
clarification of terms is necessary 
The phrase subjective probability ha 
been used in two ways: as 4 ant 
for a school of thought about th 
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| logical basis of mathematical prob- 


ability (51, 52, 80) and as a name for 
a transformation on the scale of 
mathematical probabilities which is 
somehow related to behavior. Only 
the latter usage is intended here. The 
clearest distinction between these 
two notions arises from considera- 
tion of what happens when an objec- 
tive probability can be defined (e.g., 
ina game of craps). If the subjective 
probability is assumed to be different 
from the objective probability, then 
the concept is being used in its sec- 
ond, or psychological, sense. Other 
terms with the same meaning have 
also been used: personal probability, 
psychological probability, expecta- 
tion (a poor term because of the 
danger of confusion with expected 
value). (For a more elaborate 
treatment of concepts in this area, 
see 192.) 

i In 1948, prior to the Mosteller and 
Nogee experiment, Preston and 
Baratta (149) used essentially similar 
logic and a somewhat similar experi- 
Ment to measure subjective prob- 
abilities instead of subjective values. 
They required subjects to bid com- 
petitively for the privilege of taking 
à bet. All bids were in play money, 
and the data consisted of the winning 
bids. If each winning bid can be con- 
sidered to represent a value of play 
Money such that the winning bidder 
Is indifferent between it and the bet 
he is bidding for, and if it is further 
ee that utilities are identical 
a the money value of the play 
oe "Ua and that all players have the 
a subjective probabilities, then 
a se data can be used to construct à 
Eae probability scale. Preston 

aratta constructed such a 

E The subjects, according to the 
Visi overestimate low probabilities 
a Anderestimate high ones, with an 
erence point (where subjective 
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equals objective probability) at about 
0.2. Griffith (81) found somewhat 
similar results in an analysis of 
parimutuel betting at race tracks, as 
did Attneave (17) in a guessing game, 
and Sprowls (178) in an analysis of 
various lotteries. The Mosteller and 
Nogee data (142) can, of course, be 
analyzed for subjective probabilities 
instead of subjective values. Mostel- 
ler and Nogee performed such an 
analysis and said that their results 
were in general agreement with 
Preston and Baratta’s. However, 
Mosteller and Nogee found no in- 
difference point for their Harvard 
students, whereas the National 
Guardsmen had an indifference point 
at about 0.5. They are not able to 
reconcile these differences in results. 

The notion of subjective probabil- 
ity has some serious logical difficulties. 
The scale of objective probability is 
bounded by 0 and 1. Should a sub- 
jective probability scale be similarly 
bounded, or not? If not, then many 
different subjective probabilities will 
correspond to the objective proba- 
bilities 0 and 1 (unless some trans- 
formation is used so that 0 and 1 ob- 
jective probabilities correspond to 
infinite subjective probabilities, which 
seems unlikely). Considerations of 
the addition theorem to be discussed 
in a moment have occasionally led 
people to think of a subjective 
probability scale bounded at 0 but 
not at 1. This is surely arbitrary. 
The concept of absolute certainty is 
neither more nor less indeterminate 
than is the concept of absolute im- 
possibility. 

Even more drastic logical problems 
arise in connection with the addition 
theorem. If the objective probability 
of event A is P, and that of A not 
occurring is Q, then P+Q=1. Should 
this rule hold for subjective proba- 
bilities? Intuitively it seems neces- 
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sary that if we know the subjective 
probability of A, we ought to be able 
to figure out the subjective proba- 
bility of not-A, and the only reason- 
able rule for figuring it out is sub- 
traction of the subjective probability 
of A from that of complete certainty. 
But the acceptance of this addition 
theorem for subjective probabilities 
plus the idea of bounded subjective 
probabilities means that the subjec- 
tive probability scale must be identi- 
cal with the objective probability 
scale. Only for a subjective proba- 
bility scale identical with the objec- 
tive probability scale will the 
subjective probabilities of a collec- 
tion of events, one of which must 
happen, add up to 1. In the special 
case where only two events, A and 
not-A, are considered, a subjective 
probability scale like S1 or $2 in 
Fig. 3 would meet the requirements 
of additivity, and this fact has led to 
some speculation about such scales, 
particularly about S1. But such 
scales do not meet the additivity re- 
quirements when more than two 
events are considered. 

One way of avoiding these diffi- 
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culties is to stop thinking about a 
scale of subjective probabilities and, 
instead, to think of a weighting 
function applied to the scale of objec- 
tive probabilities which weights these 
objective probabilities according to 
their ability to control behavior, Pre- 
sumably, I was studying this ability 
in my experiments on probability 
preferences (55, 57, 58, 59). "There is 
no reason why such weighted proba- 
bilities should add up to 1 or should 
obey any other simple combinatory 
principle. 

Views and experiments which com- 
bine utility and subjective probability. 
The philosopher Ramsey published 
in 1926 (reprinted in 150) an essay 
on the subjective foundations of the 
theory of probability; this contained 
an axiom system in which both utility 
and subjective probability appeared. 
He used 0.5 subjective probability as 
a reference point from which to de- 
termine utilities, and then used these 
utilities to determine other sub- 
jective probabilities. | Apparently, 
economists did not discover Ramsey's 
essay until after von Neumann and 
Morgenstern’s book aroused interest 
in the subject. The only other formal 
axiom system in which both utility 
and subjective probability play 4 
part is one proposed by Savage 
(171), which is concerned with un- 
certainty, rather than risk, and uses 
the concept of subjective probability 
in its theory-of-probability sense. 

The most extensive and important 
experimental work in the whole field 
of decision making under risk and 
uncertainty is now being carried ou 
by Coombs and his associates at the 
University of Michigan. Coombs$ 
thinking about utility and subjective 
probability is an outgrowth of his 
thinking about psychological scaling 
in general. (For a discussion 0 


views, see 43, 44, 45, 46, 47.) The 
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essence of his work is the attempt to 
measure both utility and subjective 
probability on an ordered metric 
scale. An ordered metric scale has all 
the properties of an ordinal scale, 
and, in addition, the distances be- 
tween some or all of the stimuli can 
be rank ordered. Coombs has de- 
veloped various experimental pro- 
cedures for obtaining such informa- 
tion about the spacings of stimuli. 
In the most important article on 
utility and subjective probability to 
come out of the Coombs approach, 
Coombs and Beardslee (48) present 
an analysis of gambling decisions in- 
volving three independent variables: 
utility for prize, utility for stake, and 
subjective probability. All three are 
assumed measurable only up to an 
ordered metric, although it is as- 
sumed that the psychological prob- 
ability of losing the stake is one minus 
the .psychological probability of 
winning the prize, an assumption that 
limits the permissible underlying 
Psychological probability functions 
to shapes like those in Fig. 3. An 
elaborate graphic analysis of the in- 
difference surfaces in this three- 
dimensional space is given, contain- 
ing far too many interesting relation- 
ships to summarize here. An ex- 
periment based on this model was de- 
Signed. Coombs is reluctant to use 
Sums of money as the valuable ob- 
nr in his experiments because of 
nisal that subjects will respond 
X € numerical value of the amount 
dollars rather than to the psycho- 
E value. Therefore he used 
Nus desirable objects (eg. a 
io as stimuli, and measured their 
in Mss the techniques he has de- 
ine to obtain „ordered metric 
oni He used simple numerical 
de of probability as the 
Sub; ability stimuli, and assumed that 
Jective probability was equal to 
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objective probability. The subject 
from whose judgments the ordered 
metric utility measurement was con- 
structed was then presented with 
imaginary bets involving these ob- 
jects and probabilities, and it turned 
out that she almost always chose the 
one with the higher expected utility. 
This experiment is significant only 
as an illustration of the application 
of the method; the conclusion that 
subjects attempt to maximize ex- 
pected utility cannot very comfort- 
ably be generalized to other subjects 
and to real choices without better 
evidence. 

Coombs and Milholland (49) did a 
much more elaborate experiment in 
which they established ordered metric 
scales, both for the utilities of a col- 
lection of objects and for the subjec- 
tive probabilities of a collection of 
statements (e.g., Robin Roberts will 
win 20 games next year). Statements 
and objects were combined into 
“bets,” and the two subjects for 
whom the ordered metric scales had 
been established were asked to make 
judgments about which bet they 
would most, and which they would 
least, prefer from among various 
triads of bets. These judgments were 
examined to discover whether or not 
they demonstrated the existence of 
at least one convex indifference curve 
between utility and subjective prob- 
ability (the requirements for demon- 
strating the convexity of an in- 
difference curve by means of ordered 
metric judgments are fairly easy to 
state). A number of cases consistent 
with a convex indifference curve were 
found, but a retest of the ordered 
metric data revealed changes which 
eliminated all of the cases consistent 
with a convex indifference curve for 
one subject, and all but one case for 
the other. It is not possible to make 
a statistical test of whether or not 
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that one case might have come about 
by chance. No evidence was found 
for the existence of concave indiffer- 
ence curves, which are certainly in- 
consistent with the theory of risky 
decisions. This experiment is a fine 
example of the strength and weak- 
ness of the Coombs approach. It 
makes almost no assumptions, takes 
very little for granted, and avoids 
the concept of error of judgment; as 
a result, much of the potential in- 
formation in the data is unused and 
rarely can any strong conclusions be 
drawn. 

A most disturbing possibility is 
raised by experiments by Marks (133) 
and Irwin (94) which suggest that the 
shape of the subjective probability 
function is influenced by the utilities 
involved in the bets. If utilities and 
subjective probabilities are not inde- 
pendent, then there is no hope of pre- 
dicting risky decisions unless their 
law of combination is known, and it 
seems very difficult to design an ex- 
periment to discover that law of com- 
bination. However, the main dif- 
ferences that Marks and Irwin found 
were between probabilities attached 
to desirable and undesirable alterna- 
tives. It is perfectly possible that 
there is one subjective probability 
function for bets with positive ex- 
pected values and a different one for 
bets with negative expected values, 
just as the negative branch of the 
Markowitz utility function is likely 
to be different from the positive 
branch. The results of my probabil- 
ity preference experiments showed 
very great differences between the 
probability preference patterns for 
positive and for negative expected- 
value bets (57), but little difference 
between probability preferences at 
different expected-value levels so 
long as zero expected value was not 
crossed (59). This evidence supports 
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the idea that perhaps only two sub- 
jective probability functions are nec- 
essary. 

Santa Monica Seminar. In the 
summer of 1952 at Santa Monica, 
California, a group of scientists con- 
ferred on problems of decision mak- 
ing. They met in a two-month semi- 
nar sponsored by the University of 
Michigan and the Office of Naval 
Research. The dittoed reports of 
these meetings are a gold mine of 
ideas for the student of this problem. 
Some of the work done at this semi- 
nar is now being prepared for a book 
on Decision Processes edited by R. M. 
Thrall, C. H. Coombs, and R. E 
Davis, of the University of Michigan. 

Several minor exploratory experi- 
ments were done at this seminar. 
Vail (190) did an experiment in which 
he gave four children the choice of 
which side of various bets they 
wanted to be on. On the assumption 
of linear utilities, he was able to com- 
pute subjective probabilities for these 
children. The same children, how- 
ever, were used as subjects for a 
number of other experiments; 50, 
when Vail later tried them out on 
some other bets, he found that they 
consistently chose the bet with the 
highest probability of winning, Ie 
gardless of the amounts of money 1M- 
volved. When 50-50 bets were in 
volved, one subject consistently chose 
the bet with the lowest expect 
value. No generalizable conclusions 
can be drawn from these experiments. 

Kaplan and Radner (100) tried out 
a questionnaire somewhat ^ e 
Coombs's method of measuring sub- 
jective probability. Subjects wet 
asked to assign numbers to various 
statements. The numbers coul 
anything from 0 to 100 and were v 
represent the likelihood that the 
statement was true. The hypotheses 
to be tested were: (a) for sets of ex- 


we and mutually exclusive 
nts in which the numbers as- 
(estimates of degree of belief) 
nearly equal, the sums of these 
bers over a set would increase 
the number of alternatives (be- 
c low probabilities would be over- 
estimated) ; (^) for sets with the same 
numbers of alternatives, those with 
one high number assigned would have 
a lower set sum than those with no 
igh numbers. The first prediction 
verified; the second was not. 
y judgments of this sort are so 
m more likely to be made on the 
basis of number preferences and 
ilar variables than on subjective 
bilities that they offer very 
Tittle hope as a method of measuring 
subjective probabilities. 

- Variance preferences. Allais (2, 3, 
4) and Georgescu-Roegen (78) have 
argued that it is not enough to apply 
"a transform on objective value and on 
objective probability in order to pre- 
dict risky decisions from expected 
“utility (see also 188); it is also neces- 
Sary to take into account at least the 
variance, and possibly the higher 
moments, of the utility distribution. 
"here are instances in which this 
argument seems convincing. You 
would probably prefer the certainty 
of a million dollars to a 50-50 chance 
Of getting either four million or noth- 
ing. I do not think that this prefer- 
ence is due to the fact that the ex- 
pected utility of the 50-50 bet is less 
than the utility of one million dollars 
to you, although this is possible. A 
more likely explanation is simply 
Mat the variances of the two propo- 
sitions are different. Evidence in 
favor of this is the fact that if you 
Knew you would be offered this choice 
_ 40 times in succession, you would 
Probably take the 50-50 bet each 
time. Allais (5) has constructed a 
Number of more sophisticated exam- 


THEORY OF DECISION MAKING 


401 


ples of this type. However, from a 
simple-minded psychological point of 
view, these examples are irrelevant. 
It is enough if the theory of choice 
can predict choices involving familiar 
amounts of money and familiar 
probability differences—choices such 
as those which people are accustomed 
to making. It may be necessary for 
economic theory that the theory of 
choice be universal and exceptionless, 
but experimental psychologists need 
not be so ambitious. This is fortu- 
nate, because the introduction of the 
variance and higher moments of the 
utility distribution makes the prob- 
lem of applying the theory experi- 
mentally seem totally insoluble. It is 
difficult enough to derive reasonable 
methods of measuring utility alone 
from risky choices; when it also be- 
comes necessary to measure subjec- 
tive probability and to take the 
higher moments of the utility dis- 
tribution into account, the problem 
seems hopeless. Allais apparently 
hopes to defeat this problem by using 
psychophysical methods to measure 
utility (and presumably subjective 
probability also). This is essentially 
what Coombs has done, but Coombs 
has recognized that such procedures 
are unlikely to yield satisfactory 
interval scales. The dollar scale of 
the value of money is so thoroughly 
taught to us that it seems almost im- 
possible to devise a psychophysical 
situation in which subjects would 
judge the utility, rather than the dol- 
lar value, of dollars. They might 
judge the utility of other valuable 
objects, but since dollars are the 
usual measure of value, such judg- 
ments would be less useful, and even 
these judgments would be likely to be 
contaminated by the dollar values of 
the objects. I would get more utility 
from a new electric shaver than I 
would from a new washing machine, 
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but because of my knowledge of the 
relative money values of these ob- 
jects, I would certainly choose the 
washing machine if given a choice 
between them. Somewhat similar 
arguments can be applied against 
using psychophysical methods to 
measure subjective probability. A 
final point is that, since these subjec- 
tive scales are to be used to predict 
choices, it would be best if they could 
be derived from similar choices. 

Other approaches. Shackle (175) 
has proposed a theory of decision 
making under risk and uncertainty. 
This theory is unique in that it does 
not assume any kind of maximizing 
behavior. For every possible out- 
come of a decision made in a risky or 
uncertain situation, Shackle assumes 
that there is a degree of potential 
surprise that this, rather than some 
other, outcome would occur. Every 
outcome-potential surprise pair is 
ranked in accordance with its ability 
to stimulate the mind (stimulation in- 
creases with increasing outcome and 
decreases with increasing potential 
surprise). The highest-ranking posi- 
tive outcome-potential surprise pair 
and the highest-ranking negative pair 
are found, and these two possibilities 
alone determine what the individual 
will do. Semi-mathematical methods 
are used to predict the outcome of 
consideration of possible lines of ac- 
tion. Although attempts have been 
made to relate it to Wald's minimax 
principle for statistical decision func- 
tions (see below), the fact remains 
that most critics of the Shackle point 
of view have judged it to be either too 
vague to be useful, or, if specified in 
detail, too conducive to patently ab- 
surd predictions (e.g., 201). 

Shackle's point of view was de- 
veloped primarily to deal with unique 
choices—choices which can be made 
only once. Allais (3) has similarly 
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criticized conventional utility 
ory's attack on this problem. i 
the usual frequency theory o 
ability conceives of the probab 
the limit of the outcomes of 
number of similar trials, it i$ 
tionable that notions which use 
ability in the ordinary sense 
notion of maximizing expected 
ity) are applicable to unique 
However, this seems to be an'& 
mental problem. If notions w 
ordinary probability are inca) 
predicting actual unique 
then it will be necessary to seek 
theoretical tools. But so long 
generally acceptable probabili 
be defined (e.g., as in the uni 
of a coin), it is not necessary 
sume a priori that theories bi 
conventional probabilities will 
adequate. When no generall 
ceptable probability can be d d, 


ferent. 

Cartwright and Festinger (38 
have proposed a theory about 
time it takes to make decisions 
is in some nae similar to those: 


ference is that CRAS add the 
of restraining forces, and that 


as fluctuating randomly aro 
mean value. From this they di 


which subjects will feel about | 
decisions, and apparently these 
ositions work out FW 


Lewinian theoretical " orient 
seems to lead to this kind of 
Lewin, Dembo, Beating: an 


course, the notion of utility 
similar to the Lewinian noti 
valence. 
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Landahl (115) has presented a 
mathematical model for risk-taking 
behavior based on the conceptual 
neurology of the mathematical bio- 
physics school. 

Psychological comments. The area 
of risky decision making is full of 
fascinating experimental problems. 
Of these, the development of a satis- 
factory scale of utility of money and 
of subjective probability must come 
first, since the theory of risky de- 
cision making is based on these no- 
tions. The criterion for satisfactori- 
ness of these scales must be that they 
successfully predict choices other 
than those from which they were de- 
rived. To be really satisfactory, it is 
desirable that they should predict 
choices in a wide variety of differing 
situations. Unlike the subjective 
scales usually found in psychophys- 
ics, it is likely that these scales will 
differ widely from person to person, 
so a new determination of each scale 
must be made for each new subject. 
It can only be hoped that the scales 
do not change in time to any serious 
degree; if they do, then they are 
useless, 

, Once scales of utility and subjec- 
tive probability are available, then 
many interesting questions arise. 
What about the addition theorem for 
subjective probabilities? Does gam- 
bling itself have utility, and how 
much? To what extent can these sub- 
Jective scales be changed by learning? 
To what degree do people differ, and 
can these differences be correlated 
With environmental, historical, or 
personality differences? Finally, psy- 
chologists might be able to shed light 
on the complex economic problem of 
interacting utilities of different goods. 
like x area of risky decision making, 
hd e area of the theory of games, 
a S to encourage in those inter- 

ed in it the custom of carrying out 
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small pilot experiments on their sons, 
laboratory assistants, or secretaries. 
Such experiments are too seldom 
adequately controlled, and are al- 
most never used as a basis for larger- 
scale, well-designed experiments. 
Whether an ill-designed and hap- 
hazardly executed little experiment is 
better than no experiment at all is 
questionable. The results of such 
pilot experiments too often are picked 
up and written into the literature 
without adequate warning about the 
conditions under which they were 
performed and the consequent limita- 
tions on the significance of the results. 


THE TRANSITIVITY OF CHOICES 


In the section on riskless choices 
this paper presented a definition of 
economic man. The most important 
part of this definition can be summed 
up by saying that economic man is 
rational. The concept of rationality 
involves two parts: that of a weak 
ordering of preferences, and that of 
choosing so as to maximize some- 
thing. Of these concepts, the one 
which seems most dubious is the one 
of a weakly ordered preference field. 
This is dubious because it implies 
that choices are transitive; that is, if 
A is preferred to B, and Bis preferred 
to C, then A is preferred to C. 

Two economists have designed ex- 
periments specifically intended to 
test the transitivity of choices. Pap- 
andreou performed an elaborate and 
splendidly controlled experiment 
(145) designed to discover whether or 
not intransitivities occurred in im- 
agined-choice situations. He pre- 

triplets of hypothetical bun- 
dles of admissions to plays, athletic 
contests, concerts, etc., and required 
his subjects to choose between pairs 
of bundles. Each bundle consisted of 
a total of four admissions to two 
events, e.g., 3 plays and 1 tennis 
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tournament. In the main experi- 
ment, each bundle is compared with 
two others involving the same kinds 
of events, but in the better designed 
auxiliary experiment, a total of six 
different events are used, so that each 
bundle has no events in common with 
the other two bundles in its triplet. 
Since there are three bundles in each 
triplet, there are three choices be- 
tween pairs for each triplet, and 
these choices may, or may not, be 
transitive. The subjects were per- 
mitted to say that they were indiffer- 
ent between two bundles; conse- 
quently there were 27 possible con- 
figurations of choices, of which only 
13 satisfied the transitivity axiom. 
In the main experiment, 5 per cent 
of the triplets of judgments were 
intransitive; in the auxiliary experi- 
ment, only 4 per cent. Papandreou 
develops a stochastic model for 
choices under such conditions; the 
results are certaihly consistent with 
the amount of intransitivity per- 
mitted by his model. Papandreou 
concludes that at least for his specific 
experimental conditions, transitivity 
does exist. 

May (138), using different kinds of 
stimuli in a less elaborate experiment, 
comes up with results less consistent 
with transitivity. May required a 
classroom group to make pairwise 
choices between three marriage part- 
ners who were identified only by 
saying how intelligent, good looking, 
and rich they were. Judgments of 
indifference were not permitted. The 
results were that 27 per cent of the 
subjects gave intransitive triads of 
choices. May suggests, very plausi- 
bly, that intransitive choices may be 
expected to occur whenever more 
than one dimension exists in the 
stimuli along which subjects may 
order their preferences. However, 
May would probably have gotten 
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fewer intransitivities if he had per- 
mitted the indifference judgment, If 
subjects are really indifferent among 
all three of the elements of a triad of 
objects, but are required to choose 
between them in pairs and do so by 
chance, then they will choose in- 
transitively one-fourth of the time. 
Papandreou’s stochastic model gives 
one theory about what happens 
when preferences diverge just slightly 
from indifference, but presumably a 
more detailed model can be worked 
out.  Papandreou's model permits 
only three states: prefer A to B, 
prefer B to A, and indifferent. It 
ought to be possible to base a model 
for such situations on the cumulative 
normal curve, and thus to permit any 
degree of preference. For every com- 
bination of degrces of preference, 
such a model would predict the fre- 
quency of intransitive choices. 

In the paired comparisons among 
bets (57) described in the section on 
risky choices, quite elaborate 1n- 
transitivities could and did occur. 
However, it is easy to show that any 
intransitivity involving four or more 
objects in a paired comparisons 
judgment situation will necessarily 
produce at least one intransitivity 10- 
volving three objects. Consequently, 
the intransitive triplet or circular 
triad is the best unit of analysis for 
intransitivities in these more com- 
plicated judgment situations. 
counted the frequency of occurrence 
of circular triads and found that they 
regularly occurred about 20 per cent 
of the total number of times they 
could occur. (Of course, no indiffer- 
ence judgments could be permitt i 
The experiment fulfills May's C" 
terion for the occurrence of intrans; 
tivities, since both probability an 
amount of money were present 1M 
each bet, and subjects could be si 
pected to take both into accoun 
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| when making, choices. It might be 
supposed that the difference between 
the imaginary choices of the Papan- 
| dreou and May experiments and the 
real choices in my experiment would 
lead to differences in the frequency of 
occurrence of intransitivities, but 
there were no substantial differences 
in my experiment between the fre- 
quencies of occurrence in the just- 
imagining sessions and in the real 
gambling sessions, and what differ- 
ences there were, were in the direction 
of greater transitivity when really 
gambling. These facts should facili- 
tate further experiments on this prob- 

lem. 

In one sense, transitivity can never 
be violated. A minimum of three 
choices is required to demonstrate 

 intransitivity. Since these choices 
will necessarily be made in sequence, 
it can always be argued that the per- 
son may have changed his tastes be- 
tween the first choice and the third. 
However, unless the assumption of 
constancy of tastes over the period of 
experimentation is made, no experi- 
ments on choice can ever be mean- 
ingful, and the whole theory of choice 
becomes empty (see 184 for a similar 
situation). So this quibble can be re- 
Jected at once. 

| Utility maximization will not work 
except with a transitive preference 
field. Consequently, if the models 
discussed in this paper are to predict 
experimental data, it is necessary 
that intransitivities in these data be 
infrequent enough to be considered 

. 48 errors. However, from a slightly 
different point of view (54) the occur- 
rence or nonoccurrence of transitive 
choice patterns is an experimental 
phenomenon, and presumably a law- 
ul one. May has suggested what 
that law is: Intransitivities occur 
When there are conflicting stimülus 
dimensions along which to judge. 
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This notion could certainly be tested 
and made more specific by appropri- 
ate experiments. 

A final contribution in a related, 
but different, area is Vail's stochastic 
utility model (191). Vail assumes 
that choices are dependent on utili- 
ties that oscillate in a random man- 
ner around a mean value. From this 
assumption plus a few other reason- 
able ones, he deduces that if the 
over-all preference is 1227 3, and if 
1 is preferred to 2 more than 2 is 
preferred to 3, then the frequencies of 
occurrence of the six possible transi- 
tive orderings should be ordered as 
follows: 123>132>213>312>231 
7321. This result is certainly easy 
to test experimentally, and sounds 
plausible. 


THE THEORY OF GAMES AND OF 
DECISION FUNCTIONS? 


This section will not go into the 
theory of games or into the intimately 
related subject of statistical decision 
functions at all thoroughly. These 
are mathematical subjects of a highly 


6 Marschak (134), Hurwicz (92), Neisser 
(143), Stone (181), and Kaysen (107) pub- 


lished reviews of The Theory of Games and 


intended as a textbook, which is 
simpler than von Neumann and Morgenstern 
sad puivdes certain topics further. Wald's 


198) is, of course, the classical work on 
dd Bross's book 
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technical sort, with few statements 
which lend themselves to experimen- 
tal test. Rather, the purpose of this 
section is to show how these subjects 
relate to what has gone before, to give 
a brief summary of the contents of 
Theory of Games and Economic Be- 
havior by von Neumann and Morgen- 
stern (197), and to describe a few ex- 
periments in the area of game playing 
—experiments which are stimulated 
by the theory of games although not 
directly relevant to it. 

The theory of games. The theory of 
games probably originated in the 
work of Borel (31, 32, 33, 34; see also 
71, 72) in the 1920’s. In 1928, von 
Neumann (195), working independ- 
ently of Borel, published the first 
proof of the fundamental theorem in 
the theory, a theorem that Borel had 
not believed to be generally true. 
However, the subject did not become 
important until 1944, when von 
Neumann and Morgenstern pub- 
lished their epoch-making book (196). 
(A second edition, with an appendix 
on cardinal utility measurement, 
came out in 1947 [197].) Their pur- 
pose was to analyze mathematically a 
very general class of problems, which 
might be called problems of strategy. 
Consider a game of tic-tac-toe. You 
know at any moment in the game 
what the moves available to your op- 
ponent are, but you do not know 
which one he will choose. The only 
information you have is that his 
choice will not, in general, be com- 
pletely random; he will make a move 
which is designed in some way to in- 
crease his chance of winning and di- 
minish yours. Thus the situation is 
one of uncertainty rather than risk, 
Your goals are similar to your op- 
ponent’s. Your problem is: what 
strategy should you adopt? The 
theory of games offers no practical 
help in developing strategies, but it 
does offer rules about how to choose 
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among them. In the case of tic-tac 
toe, these rules are trivial, since 
either player can force a draw, But 
in more complicated games of strat- 
egy, these rules may be useful, Ip 
particular, the theory of games may 
be helpful in analyzing proper strat- 
egy in games having random ele 
ments, like the shuffling of cards, or 
the throwing of dice. It should be 
noted that the concept of a game isan 
exceedingly general concept. A sciens 
tist in his laboratory may be con- 
sidered to be playing a game against 
Nature. (Note, however, that we 
cannot expect Nature to try to defeat 
the scientist.) Negotiators in a labor 
dispute are playing a game against 
one another. Any situation in which 
money (or some valuable equivalent) 
may be gained as the result of à 
proper choice of strategy can be con- 
sidered as a game. 

To talk about game theory, a few 
technical terms are necessary. 
strategy is a set of personal rules for 
playing the game. For each possible 
first move on your part, your op- 
ponent will have a possible set of re 
sponses. For each possible response 
by your opponent, you will have a set 
of responses, and so on through the 
game. A strategy is a list which spect 
fies what your move will be for every 
conceivable previous set of moves 0 
the particular game you are playing 
Needless to say, only for the simplest 
games (e.g., matching pennies) does 
this concept of strategy have any 
empirical meaning. N 

Associated with strategies are Mk 
putations. An imputation is a set 0! 
payments made as a result of a game 
one to each player. In general, differ- 
ent imputations will be associat 
with different sets of strategies, but 
for any given set of strategies the 
may be more than one imputation 
(in games involving coalitions). . 

Imputation X is said to dominalé 


putation Y if one or more of the 
yers has separately greater gains 
smaller losses) in X than in Y and 
, by acting together (in the case of 
more than one player), enforce the 
eecurrence of X, or of some other im- 
putation at least as good. The rela- 
fionship of domination is not transi- 
tive. 
A solution is a set of imputations, 
mone of which dominates another, 
such that every imputation outside 
solution is dominated by at least 
one imputation within the solution. 
on Neumann and Morgenstern as- 
sert that the task of the theory of 
games is to find solutions. For any 
game, there may be one or more than 
one. One bad feature of the theory of 
games is that it frequently gives a 
large, or even infinite, number of solu- 
tions for a game. 

The above definitions make clear 
that the only determiner of behavior 
in games, according to this theory, is 
the amounts of money which may be 
Won or lost, or the expected amounts 
games with random elements. The 
fun of playing, if any, is irrelevant. 

The minimax loss principle. The 
notions of domination and of solution 
"imply a new fundamental rule for 

decision making—a rule sharply dif- 
ferent from the rule of maximizing 
utility or expected utility with which 
this paper has been concerned up to 
1 this section. This rule is the rule of 
minimizing the maximum loss, or, 
more briefly, minimax loss. In other 
Words, the rule is to consider, for each 
Possible strategy that you could 
- adopt, what the worst possible out- 
come is, and then to select that strat- 
€gy which would have the least ill- 
Es if the worst possible outcome 
appened. Another way of putting 
the same idea is to call it the principle 
of maximizing the minimum gain, or 
E gain. This rule makes con- 
iderable sense in two-person games 
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when you consider that the other 
player is out to get you, and so will 
do his best to make the worst possible 
outcome for you occur, If this rule is 
expressed geometrically, it asserts 
that the point you should seck is a 
saddle-point, like the highest point in 
a mountain pass (the best rule for 
crossing mountains is to minimize 
maximum height, so explorers seck 
out such saddle-points). 

Before we go any further, we need 
a few more definitions. Games may 
be among any number of players, but 
the simplest game is a two-person 
game, and it is this kind of game 
which has been most extensively and 
most successfully analyzed. Funda- 
mentally, two kinds of payoff ar- 
rangements are possible, The sim- 
plest and most common is the one in 
which one player wins what the other 
player loses, or, more generally, the 
one for which the sum of all the pay- 
ments made as a result of the game is 
zero. This is called a sero-sum game. 
In monzero-sum games, analytical 
complexities arise. These can be di- 
minished by assuming the existence 
of a fictitious extra player, who wins 
or loses enough to bring the sum of 
payments back to zero. Such a ficti- 
tious player cannot be assumed to 
have a strategy and cannot, of course, 
interact with any of the other players. 

In zero-sum two-person games, 
what will happen? Each player, ac- 
cording to the theory, should pick his 
minimax strategy. But will this re- 
sult in a stable solution? Not always. 
Sometimes the surface representing 
the possible outcomes of the game 
does not have a saddle-point. In this 
case, if player A chooses his minimax 
strategy, then player B will have an 
incentive not to use his own minimax 
strategy, because having found out 
his opponent's strategy, he can gain 
more by some other strategy. Thus 
the game has no solution. 
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Various resolutions of this problem 
are possible. Von Neumann and 
Morgenstern chose to introduce the 
notion of a mixed strategy, which is a 
probability distribution of two or 
more pure strategies. The fundamen- 
tal theorem of the theory of games is 
that if both players in a zero-sum 
two-person game adopt mixed strat- 
egies which minimize the maximum 
expected loss, then the game will al- 
ways have a saddle-point. Thus each 
person will get, in the long run, his 
expected loss, and will have no in- 
centive to change his behavior even 
if he should discover what his op- 
ponent’s mixed strategy is. Since A is 
already getting the minimum possible 
under the strategy he chose, any 
change in strategy by B will only in- 
crease A's payoff, and therefore cause 
B to gain less or lose more than he 
would by his own minimax strategy. 
The same is true of B. 

Games involving more than two 
people introduce a new element—the 
possibility that two or more players 
will cooperate to beat the rest. Such 
a cooperative agreement is called a 
coalition, and it frequently involves 
side-payments among members of the 
coalition. The method of analysis for 
three-or-more-person games is to con- 
sider all possible coalitions and to 
solve the game for each coalition on 
the principles of a two-person game. 
This works fairly well for three-per- 
son games, but gets more complicated 
and less satisfactory for still more 
people. 

This is the end of this exposition of 
the content of von Neumann and 
Morgenstern’s book. It is of course 
impossible to condense a tremendous 
and difficult book into one page. The 
major points to be emphasized are 
these: the theory of games is not a 
model of how people actually play 
games (some game theorists will dis- 
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J 
agree with this), nor is it likely to be 
of any practical use in telling yog 
how to play a complicated game; the 
crux of the theory of games is the 
principle of choosing the strategy 
which minimizes the maximum ex 
pected financial loss; and the theory 
defines a solution of a game as a set 
of imputations which satisfies this 
principle for all players. 

Assumplions. In their book vom 
Neumann and Morgenstern say “We 
have...assumed that [utility] i 
numerical . . . substitutable and une 
restrictedly transferable between the 
various players.” (197, p. 604.) Game. 
theorists disagree about what this 
and other similar sentences mean. 
One likely interpretation is that they: 
assume utility to b- linear with the 
physical value of money involved im 
a game and to be interpersonally 
comparable. The linear utility curves 
seem to be necessary for solving two: 
person games; the interpersonal com- 
parability is used for the extension to 
n persons. Attempts are being made: 
to develop solutions free of these a& 
sumptions (176). 

Statistical decision functions. Von 
Neumann (195) first used the mini 
max principle in his first publication 
on game theory in 1928. Neyman 
and Pearson mentioned its applic- 
ability to statistical decision prob- 
lems in 1933 (144). Wald (198), who 
prior to his recent death „was the 
central figure in the statistical deci" 
sion-function literature, first seriously 
applied the minimax principle to stà- 
tistical problems in 1939. Appa 
ently, all these uses of the princ! e 
were completely independent of one 
another. 

After Theory of Games and Ew. 
nomic Behavior appeared in 194, 
Wald (198) reformulated the problem 
of statistical decision making as 0€ 
of playing a game against Nature 
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The statistician must decide, on the 
basis of observations which cost 
| something to make, between policies, 
each of which has a possible gain or 
los, In some cases, all of these gains 
and losses and the cost of observing 
can be exactly calculated, as in in- 


dustrial quality control. In other 
cases, as in theoretical research, it is 
necessary to make some assumption 
about the cost of being wrong and the 
gain of being right. At any rate, when 
they are put in this form, it is obvious 
that the ingredients of the problem of 
statistical decision making have a 
gamelike sound. Wald applied the 
minimax principle to them in a way 
essentially identical with game the- 
ory. 
A very frequent criticism of the 
minimax approach to games against 
Nature is that Nature is not hostile, 
as is the opponent in a two-person 
game. Nature will not, in general, 
use a minimax strategy. For this 
reason, other principles of decision 
making have been suggested. The 
simple principle of maximizing ex- 
pected utility (which is the essence of 
the Bayes's theorem [15, 198] solution 
of the problem) is not always applica- 
ble because, even though Nature is 
Not hostile, she does not offer any 
Way of assigning a probability to each 
Possible outcome. In other words, 
Statistical decision making is a prob- 
lem of uncertainty, rather than of 
risk, Savage has suggested the prin- 
ciple of minimaxing regret, where re- 
gret is defined as the difference be- 
tween the maximum which can be 
one under any strategy given a 
^ Cei state of the world and the 
ad ount gained under the strategy 

opted. Savage believes (170, also 
EOM communication) that neither 
Wald eumann and Morgenstern nor 
à Actually intended to propose 

€ principle of minimaxing loss; they 
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confined their discussions to cases in 
which the concepts of minimax loss 
and minimax regret amount to the 
same thing. Other suggested princi- 
ples are: maximizing the maximum 
expected gain, and maximizing some 
weighted average of the maximum 
and minimum expected gains (93). 
None of these principles commands 
general acceptance; each can be 
made to show peculiar consequences 
under some conditions (see 170). 

Experimental games. The concepts 
of the theory of games suggest a new 
field of experimentation: How do 
people behave in game situations? 
Such experimentation would center 
on the development of strategies, par- 
ticularly mixed strategies, and, in 
three-or-more-person games, on the 
development of coalitions and on the 
bargaining process. You should re- 
member that the theory of games 
does not offer a mathematical model 
predicting the outcomes of such 
games (except in a few special cases) ; 
all it does is offer useful concepts and 
language for talking about them, and 
predict that certain outcomes will 
not occur. 

A few minor experiments of this 
kind have been conducted by Flood, 
a mathematician, while he was at 
Rand Corporation. He usually used 
colleagues, many of whom were ex- 
perts in game theory, and secretaries 
as subjects. The general design of 
his experiments was that a group of 
subjects were shown a group of de- 
sirable objects on à table, and told 
that they, as a group, could have the 
first object they removed from the 
table, and that they should decide 
among themselves which object to 
choose and how to allocate it. In the 
first experiment (64) the allocation 
problem did not arise because enough 
duplicate objects were provided so 
that each subject could have one o 
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the kind of object the group selected. 
The subjects were Harvard under- 
graduates, and the final selection was 
made by negotiation and voting. In 
the second experiment (65), in which 
the subjects were colleagues and sec- 
retaries, a long negotiation process 
eliminated some of the objects, but a 
time limit forced a selection by lot 
from among the rest. Further negoti- 
ations to solve the allocation problem 
were terminated by a secretary, who 
snatched the object, announced that 
it was hers, and then tried to sell it. 
No one was willing to buy, so the ex- 
periment terminated. Other experi- 
ments (66, 67) showed that coalitions 
sometimes form, that a sophisticated 
subject could blackmail the group for 
an extra side-payment by threatening 
to change his vote, and that the 
larcenous secretary, having succeeded 
once, had to be physically restrained 
in subsequent sessions to prevent 
more larceny. The general conclusion 
suggested by all these experiments is 
that even experts on game theory are 
less rational and more conventional 
than game theory might lead experi- 
menters to expect. . 
Psychological comments. The most 
nutritive research problems in this 
area seem to be the social problems of 
how bargaining takes place. Flood’s 
experiments left bargainers free and 
used physical objects, whose utilities 
probably vary widely from subject to 
subject, as stimuli to bargain over. 
This is naturalistic, but produces 
data too complex and too nonnumeri- 
cal for easy analysis. A simpler situa- 
tion in which the possible communi- 
cations from one bargainer to an- 
other are limited (perhaps by means 
of an artificial vocabulary), in which 
the subjects do not see one another, 
and in which the object bargained 
over is simple, preferably being 
merely a sum of money, would ‘be 
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better. Physical isolation of on 
ject from another would make 
sible to match each subject ag 
standard bargainer, the experi 
or a stooge, who bargains by d 
set of rules that are unknown to 
subject. Flood (personal comm 
tion) is conducting experim 
this sort. For three-or-more- 
games, Asch's (16) technique o 
a group consisting of only on 
subject and all the rest stooges m 
well be used. It would be inte 
for instance, to see how the pi 
ity of a coalition between two pla 
changes as the number and po 
players united against them inci 
The theory of games is th 
among those described in this 
in which the uncontrolled and 
ally planned ''pilot experim 
most likely to occur. Such 
ments are at least as dangerous! 
as they are in the area of ris 
cision making. Flood's results 
gest that it is especially importa 
use naive subjects and to use 
only once, unless the effects of e 
ness and experience are the 
concern of the experiment. 


SUMMARY i 
bd 


For a long time, economists and 
others have been developing ath 

matical theories about how 
make choices among desirable 


that is, that they have transm 
preferences and that they cho 
such a way as to maximize ut 
expected utility. 

The traditional theory of 
choices, a straightforward thet 
utility maximization, was chal 
by the demonstration that then 
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matical tool of indifference curves 
made it possible to account for risk- 
less choices without assuming that 
utility could be measured on an in- 
terval scale. The theory of riskless 
choices predicted from indifference 
curves has been worked out in detail. 
Experimental determination of indif- 
ference curves is possible, and has 
been attempted. But utility meas- 
ured on an interval scale is necessary 
(though not sufficient) for welfare 
economics. 

Attention was turned to risky 
Choices by von Neumann and Mor- 
genstern's demonstration that com- 
plete weak ordering of risky choices 
implies the existence of utility meas- 
urable on an interval scale. Mosteller 
and Nogee experimentally deter- 
mined utility curves for money from 
gambling decisions, and used them to 
predict other gambling decisions. 
Edwards demonstrated the existence 
of preferences among probabilities in 
gambling situations, which compli- 
Cates the experimental measurement 
of utility. Coombs developed a model 
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for utility and subjective probability 
measured on an ordered metric scale, 
and did some experiments to test im- 
plications of the model. 

Economists have become worried 
about the assumption that choices 
are transitive. Experiments have 
shown that intransitive patterns of 
choice do occur, and so stochastic 
models have been developed which 
permit occasional intransitivities. 

'The theory of games presents an 
elaborate mathematical analysis of 
the problem of choosing from among 
alternative strategies in games of 
strategy. This paper summarizes the 
main concepts of this analysis. The 
theory of games has stimulated in- 
terest in experimental games, and a 
few bargaining experiments which 
can be thought of in game-theoretical 
terms have been performed. 

All these topics represent a new 
and rich field for psychologists, in 
which a theoretical structure has al- 
ready been elaborately worked out 
and in which many experiments need 
to be performed. 
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Part I 


Tue SOCIAL PSYCHOLOGICAL 
DATA 


Five years after the report on men, 
the publication of Sexual Behavior 
in the Human Female by Kinsey and 
his collaborators provides the com- 
panion results on women. It provides 
the greatest wealth of data ever ac- 
cumulated on the sexual behavior of 
a particular population, American 
women, and therefore is a factual con- 
tribution of great interest to psychol- 
ogists. To social psychologists and 
those interested in the relations of 
man to society, the new volume un- 
` derscores and rounds out a basic find- 

„ing of the first volume—namely, 
the widespread occurrence of a vari- 
ety of unsanctioned forms of sexual 
behavior both among males and fe- 
males, and the disparity between 
societal norms and such behavior. 

While the novelty of this finding 
may have worn off slightly, the new 
volume has other compensating fea- 


1 Kinsey, A. C., Pomeroy, W. B., MAr- 
TIN, C. E., & GEBHARD, P. H. Sexual be- 
havior in the human female. Philadelphia: 
Saunders, 1953. Pp. xxx+842. $8.00. 

2 This multiple review was divided so that 
H. Hyman was concerned with Part I, the 
social psychological data, and J. E. Barmack 
confined himself to Part II, the biological 
material. The reviewers worked independ- 
ently of each other. 
wa The reviewer wishes to acknowledge his 
indebtedness to Paul B. Sheatsley, with whom 
he collaborated on a longer essay on the 
Kinsey reports (cf. GEppEs, D. P. [Ed.] 
An analysis of the Kinsey reports on sexual be- 
havior in the human male and female. New 
York: New American Library of World Lit- 
erature, and E. P. Dutton, 1954). 
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tures. The simple fact that it is the 
second report suggests that the re- 
searchers may well have benefited 
from their past experience, with 
consequent improvements in the 
quality of their findings. This not 
only provides opportunity for im- 
provement of methods, but it also 
permits formerly neglected areas of 
inquiry to be covered. Thus, in the 
earlier volume the emotional and atti- 
tudinal accompaniments of various 
patterns of behavior were hardly 
touched. We shall note below that 
considerable information of this type 
has been added, making the volume 
more relevant to psychologists. 

In addition, the new volume is not 
merely a descriptive study like the 
first; Kinsey now can compare the 
behavior of males and females. The 
comparative profiles of the v 
behavior of men and women provide 
evidence on the degree of “comple 
mentarity" of partners. It is such 
data that are relevant to profound 
questions of conflict in marital and 
interpersonal relations, rather than 
separate descriptive data of the be- 
havior of one group or the other. And 
much of the new volume exploits these 
materials; each chapter ends 
a comparison of men and women, 

a special chapter is devoted to gen! 

psychological differences between the 
sexes. Further, by parallel analyses 
of the determinants of sexual patterns 
in men and women, there emerge 
data on the differential and lesser 1m7 
pact of social factors on the sexua 
development of women—a particu 
larly relevant body of materials for 
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studies of socialization. Parallel to 
these findings are other new materials 
relevant to general problems of learn- 
ing theory and psychological de- 
velopment. In both volumes, sexual 
behavior was examined in relation to 
yarious group memberships. Butnow, 
supplementary analyses of a more dy- 
namic sort are introduced. Kinsey 
studies the relation of the individual's 
early sexual patterns to his subse- 
quent sexual behavior and adjust- 
ment. Obviously, such information 
is valuable both to clinicians and 
theorists. 

The substantive contributions out- 
lined above can make the new vol- 
ume of great importance, provided 
the methods employed were such as 
to yield sound findings. It is to the 
question of methods therefore that 
this review will be directed. 

This question cannot be answered 

fly or in any simple way. The 
official committee appointed by the 
American Statistical Association to 
feview the statistical methods em- 
Ployed in the first Kinsey report 
not issue its report until three 
Years after it had been established as 
à committee, With such distin- 
guished members as W. G. Cochran, 
Frederick Mosteller, and John W. 
ukey, it not only examined the 
Published work, but it also spent a 
Period of time at the Institute at 
"diana University and engaged in 
"tain additional researches and 
‘amination of other literature. The 
tailed evaluation will require mono- 
graphic publication, and the sum- 
Mary findings will require some 43 
IN of journal treatment. It is of 
ote that this expert, judicial, and 
T'àsonable body remarks: 

It would have been possible to write two 
lt ually Correct reports, one of which would 
Kpy. © impression with the reader that 

5 work was of the highest quality, the 
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other that the work was of poor quality and 
that the major issues were evaded. We have 
not written either of these extreme reports 
(1, p. 674). 


Evaluation of the methods will 
require a brief discussion of each 
stage in the new inquiry: the nature 
of the sample of 5,940 white Ameri- 
can females who provide the major 
data; the quality of the interviewing; 
the adequacy of the coverage of the 
interview schedule and its constrüc- 
tion; and analysis of the data. We 
must also allude to the methods in 
the earlier volume. Since much of the 
significance of the new work derives 
from the comparative data on male 
and female, one must examine these 
data in an attempt to discover 
whether or not the differences demon- 
strated are an artifact of some change 
in method. 

One reasonable standard to apply 
to this volume is the level of improve- . 
ment in methods, as compared with ' 
the earlier report. The novelty of 
the earlier investigation was bound 
to create error. But the many criti- 
cisms of the first volume, and the 
additional experience, should have 
led to considerable improvement. 
That this is the case seems beyond 
question. Certain major defects have 
been remedied. Thus, there is much 
greater attention to error and more 
explicit reference to limitations. 
However, it should be noted that 
there are limits to the amount and 
kinds of improvement that could be 
achieved in the second volume. While 
the publication date may suggest 
that the new work had the benefit 
of some years of additional experi- 
ence, this is clearly not the case. 
Both inquiries started in 1938 and 
"throughout the years, female his- 
tories have been added at approxi- 
mately the same rate as the histories 
of males." 'Thus, Kinsey's sampling 
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and interviewing procedures cannot 
have been altered appreciably. What 
is subject to change are the analytic 
procedures since, in the processing 
and reporting of female data, Kinsey 
had the benefit of additional time. 
However, even here changes cannot 
be made lightly. For the compara- 
tive findings on males and females to 
be valid, the methods must be com- 
parable. Bearing in mind such limi- 
tations, we can clearly see that Kin- 
sey has made considerable improve- 
ment. 

The sample. Critics of the first 
volume questioned Kinsey's general- 
izations to the whole American male 
population from his sample data. 
While they varied with respect to the 
inadmissibility of certain procedures, 
there was much agreement that the 
generalizations departed consider- 
ably from proper caution. In the 
present volume, Kinsey is cautious 
about generalizations, and he makes 
no attempt to extend his findings to 
all American women. A telling criti- 
cism of the earlier report was that 
no reader could determine just how 
representative of any particular uni- 
verse the sample was, because no- 
where was there any systematic ac- 
count of the distribution of the 5,300 
males in terms of standard char- 
acteristics. This defect has now been 
remedied by means of two elaborate 
tables which present detailed infor- 
mation on the composition of the 
5,940 white females. 

While Kinsey has this time pro- 
vided us with these two tables and 
has carefully refrained from making 
unsupported generalizations, it can 
be demonstrated that the sample does 
not accurately represent the white 
female population of the nation. 
Some critics may condemn the whole 
investigation, arguing that a better 
method could have been employed— 
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one which would have permitted un- 
qualified statements about the popu- 
lation as a whole. The question is de- 
batable. Kinsey argues that any 
scientific sampling of predesignated 
respondents would have been impos- 
sible because a high proportion of such 
selected individuals could not be 
counted upon to consent to the inter- 
view and to answer the questions 
truthfully. In the present volume, 
all respondents were “volunteers” 
in the sense that they allowed them- 
selves to be interviewed. However, 
histories were not accepted simply 
because a subject expressed the desire 
to be interviewed; all the respondents 
were selected because of their im- 
portance to the over-all sampling 
plan. Kinsey works primarily 
through formal or informal groups 
and makes ingenious and skilful use 
of social pressures. He describes some 
group contacts which required a year 
or two of cultivation. Indeed, the im- 
mense diversity of groups he has 
worked with is a tribute to his per- 
sistence and skill. 

Nevertheless, one must still raise 
grave questions concerning the sam- 
ple. For one thing, a substantial por- 
tion of the U. S. population claims 
no organizational membership what- 
ever. Such individuals would likely 
be excluded from the samples, and 
there is evidence that they differ in 
many respects, perhaps including sex- 
ual behavior, from those who do belong 
to groups and associations. In addi- 
tion, Kinsey could seldom interview 
all or even nearly all of the members 
of any particular group. No figures 
are given on the proportion of "re- 
fusals" to be interviewed, though 
Kinsey does state that 15% of the 
female sample is composed of “100% 
groups"—that is, groups in which 
every member was interviewed— 
and "we may report that a consider- 


able proportion of the rest of the sam- 
has been drawn from groups in 
which something between 50 and 
90 per cent of all the members had 
contributed histories" (p. 30). Kin- 
sey further states: “Such coverage 
should provide a good sample of 
those particular groups" (p. 30), 
but this is a dubious assumption. 
There is no reason to believe that the 
histories of the 10-50% who do not 
contribute would necessarily agree 
with those who do. A principal 
finding of the Kinsey reports (to 
many, their most controversial and 
significant finding) has been the high 
incidences and frequencies of both 
sanctioned and unsanctioned forms 
of sexual behavior reported by both 
men and women. A speculation 
which cannot be answered concerns 
the possibility that those women who 
agree to an interview might be less 
inhibited and consequently may en- 
gage in more different types of sexual 
activity, and may do so more fre- 
quently. : 

Aside from this consideration, it is 
puzzling that Kinsey's sample is not 
more nearly representative of the 
Population in respect to a number of 
gross characteristics. Perhaps the 
most striking deficiency is the failure 
to interview enough females whose 
education did not extend beyond 
grammar school. Some of the most 
interesting differences in the sexual 
behavior of the males studied were 
those attributed to education, and 
one would have expected compara- 
tive data in the present study. In- 
Stead, only 3% of the women inter- 
Viewed had not attended high school, 
and Kinsey is restrained from making 
any substantial statements about 
t 1$ important stratum of the popu- 
lation. On the other hand, 75% of 
the total female sample had attended 
College, and 19% had gone on to 
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postgraduate work. The significance 
of Kinsey's findings must be con- 
siderably diluted when we consider 
that three-fourths of his sample 
came from the 13% of American wom- 
en who have gone to college, and 
that the 40% of American women 
who never went beyond the eighth 
grade comprised only 3% of those he 
studied. Similarly, the sample is 
deficient in the older age groups and 
is heavily weighted with women in 
their 30's. Some of these sampling 
biases have been deliberate in order 
to build up sufficient cases among 
particular subgroups for special 
analyses. Thus, although Jews repre- 
sent only about 4% of the U. S. popu- 
lation, they account for more than 
one-quarter of Kinsey's sample. In 
order to make statements about re- 
ligion and sexual behavior, Kinsey 
did require sufficient Jews to allow 
thorough analysis of this religious 
group. In the case of the age and 
education biases, however, this rea- 
son does not seem relevant, and one 
can only wonder at the failure to 
achieve more adequate representation 
in terms of these two important fac- 
tors. 

Obviously one should not dismiss 
Kinsey's study simply because he 
cannot speak authoritatively about 
every group in the population. How- 
ever, one feature of the present vol- 
ume does raise a new question about 
the sampling method; that is the 
comparative analysis of male and 
female behavior. Many statistical 
data demonstrating sex differences 
are presented, and their impact is 
powerful. But a problem that con- 
tinually arises in the interpretation 
of these findings, and one that is 
not stressed by Kinsey, is whether 
the differences represent true sex 
differences or merely differential sam- 
pling biases in the two sets of data. 
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If the biases were equivalent for 
both men and women, the stated 
differences are nonetheless true, 
though limited in generality. But if 
the male sample is biased in one re- 
spect (e.g., too many uneducated) 
and the females in an opposite re- 
spect (too many educated), then the 
interpretation of any differences 
found in the data becomes very diffi- 
cult. Since the over-all composition 
of the male sample was never re- 
ported in the first volume, the facts 
cannot be readily established. At- 
tempts to infer this composition from 
scattered tables suggest that some of 
the sex differences may be artifacts of 
differential sampling biases. Thus, 
almost 30% of the women are Jewish, 
while the best guess for the men 
would place only about 15% in that 
class. Only 3% of the women are in 
the grammar school group, but in 
the male sample this proportion is 
much larger. If religious affiliation 
and education are significantly cor- 
related with both male and female 
sexual behavior, the differential 
biases in the two samples could well 
explain some of the aggregate sex 
differences reported. 

However, the criticism is not as 
serious as it might seem. Kinsey pre- 
sents much evidence that female 
sexual behavior is less affected by 
many of the social characteristics 
studied than is male behavior. Thus, 
biases with respect to these factors 
could not account for very much of 
the difference reported. Further- 
more, Kinsey presents in summary 
form in the female volume, and in 
detail in the two separate volumes, 
the male and female findings for vari- 
ous subgroups, so that the careful 
reader can estimate the degree to 
which such biases may affect the over- 
all differences cited. Nevertheless, 
any differential biases with respect to 
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characteristics other than those 
lated might still remain as an 
known source of error in the coi 
parisons. Thus, Kinsey states that 
female sexual behavior is subject 
to much greater variability than the 
male. Since some other unknown: 
characteristics must determine the 
variation among females, any such” 
differential sampling biases might 
well contribute to a spurious finding: 
Kinsey does present some empirical 
evidence bearing on the gener 
problem when, for example, he shows 
a close agreement between the male 
and female findings with respect to 
marital coitus. The frequencies € 
marital coitus would, by definiti 
have to be identical in the aggreg 
for the two samples if the sample 
were equivalent. In many instance 
a very high degree of agreement 
reported, but for some few aspe 
of behavior the reports of the 
samples do not closely conform. The 
speculation is not generally wa 
ranted, but some limited number oL 
the reported sex differences may V 
reflect the net operation of all the 
differential biases in the sampling oF 
the two groups. u 
The interview schedule and the inter: 
viewers. As in the case of the males 
approximately two hours were spen 
interviewing each subject and cover 
ing between 300 and 500 items 0° 
information. No questionnaire We 
presented in either volume. 
effectively prevents one from 
praising the types of questions 
Kinsey asked and the manner P 
which they were presented. Kinsey 
argues persuasively against the be 
lief that "standard questions 
through diverse human machines € 
bring standard answers," and he use 
no questionnaire at all in the ust? 
sense of the word. The items to * 
covered and the definition of ea© 
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item were standard, but the wording 
and order of the particular questions 
were varied in the most meaningful 
manner for each respondent. Since 
the objective conditions of the inter- 
view were not uniform, one can never 
be certain that the differences found 
between individuals and groups are 
not due, at least in part, to differences 
in the wording and order of the ques- 
tions. This consideration takes on 
added weight in connection with the 
many comparisons between males 
and females. Here, the question of 
consistency of interviewing procedure 
might be raised, but unfortunately 
can never be answered. 

Offsetting this possible danger was 
the fact that all of the interviewing 
was done by a very small staff of 
highly trained interviewers. Eighty- 
five per cent of the male interviews, 
and approximately 80% of the fe- 
male, were collected by only two 
interviewers. Only six interviewers 
were used for the male volume, and 
only four for the female study. Each 
of these interviewers was thoroughly 
trained and in continuing contact 
with the others. When their years of 
experience in handling interviews 
With every sort of individual are 
considered, one can assume that 
procedures were more uniform than 
might appear, in spite of the large 
measure of freedom to exercise judg- 
ment from case to case. 

Kinsey presents a variety of evi- 
dence bearing on the quality of the 
interview data. One hundred and 
twenty-four females were reinter- 
Viewed after a minimum time lapse 
of 18 months. Compared with other 
Studies of the reliability of interview 
feports over short spans of time and 
On issues much less emotional than 
Sexual behavior, the extent of agree- 
Ment between the first and second 
Interviews js remarkably high (for a 
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summary of such studies, see 2). 
Validity of response is examined by 
comparing the results obtained from 
separate interviews with 706 pairs of 
spouses; here, by definition, any dis- 
agreement in reports on marital his- 
tory and behavior would constitute 
invalidity. Again, the general find- 
ings are good and, compared with 
validity studies in the past interview 
literature, they are often amazingly 
good, (see 3 for a general summary). 
It should be noted, however, that 
such validity checks have been con- 
fined to spouses whose joint sexual 
behavior is more or less "sanctioned." 
Whether or not the same degree of 
validity holds for less sanctioned 
areas of behavior, such as the homo- 
sexual or extramarital, is not demon- 
strated. 

One consideration in evaluating the 
findings was the fact that all of the 
interviewers were males. There is 
considerable evidence for many types 
of studies that the group member- 
ship (e.g., color, class) of the inter- 
viewer may seriously affect the re- 
sponses. And men and women have 
been observed to give different re- 
plies, on many questions, depending 
upon the sex of the interviewer (cf. 
2). True, such studies have also 
shown that the more experienced 
and capable interviewers have been 
able to some extent to overcome the 
effects, and we would expect such 
effects to beat a minimum with inter- 
viewers of the caliber of Kinsey and 
his associates. But it has been estab- 
lished that statistical results can vary 
by as much as 10 percentage points, 
depending upon the sex of the inter- 
viewer, and one has no way of meas- 
uring the possible effects in this 
study. 

The quality of the reports. A variety 
of criticisms of Kinsey’s research have 
been advanced, and will continue to 
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be advanced, on sheer axiomatic 
grounds. For example, the claim is 
made that data concerning the most 
intimate of human experiences, a 
portion of which may be socially 
disapproved or remote in time, can- 
not, when collected by straightfor- 
ward question-and-answer proce- 
dures in a relatively short interview, 
possibly be accurate. 

With respect to criticisms based on 
axiomatic grounds, however, it is 
proper that the burden of proof rest 
on the critics. Kinsey himself cites 
considerable empirical support for 
the quality of his material. Relia- 
bility is high, as measured by re- 
interviews after a considerable lapse 
of time. Agreement in the independ- 
ent reports of spouses concerning 
their joint behavior is high. There is 
some evidence that even events 
which occurred many years ago and 
are remote in the respondent's mem- 
ory are nevertheless enumerated with 
considerable accuracy. Thus, Kin- 
sey's findings on age of menarche, 
and on other indices of physical de- 
velopment, as reported retrospec- 
tively by his respondents, agree 
closely with independent studies 
based on direct observation of these 
matters. High agreement is demon- 
strated between many of his findings 
and the findings of earlier independ- 
ent investigators who relied on differ- 
ent techniques. The material from 
the reinterviews and from the inter- 
views with spouses concerning mat- 
ters remote in time is somewhat less 
reliable than their reports of more 
recent events, but is still surprisingly 
good. 

Terman, in his distinguished re- 
view of the earlier volume, noted 
that Kinsey had not taken proper 
account of possible memory errors 
in the way he processed and analysed 
his data (4). Terman did not reject 
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the material collected by remote res 
call, but argued against Kinsey's 
procedure of describing sexual be- 
havior at given ages by lumping the 
reports of respondents actually at | 
that age at the time of interview: 
with the reports of those who had. 
to recall the same period from their 
early lives. 


In the computation of mean frequency of 
masturbation at age 15, for example, the 
memory report of a 50-year-old counts as 
heavily as the report of a 15-year-old . « 
It would have been helpful if he had shown in 
the tables what proportion of the N at that , 
given age level was accounted for by subjects 
at or near that age (4, pp. 446, 450). 


] 
In the present volume, Kinsey 
again lumps retrospective and cure 
rent reports, but he has this time pro- 
vided basic tables showing the age 
distribution at time of report, not 


only for his total sample of females, 
but also for particular subgroups. 
The reader can now weigh for him 
the vulnerability of certain of the 
data in the light of possible memory 
distortions. i 

Types of data collected. Critics have 
often concerned themselves with 
Kinsey's definition of the domain of 
sexual behavior and his emphasis on | 
orgasm as the unit of measurement of 1 
such behavior. By and large, Kinsey , 
has approached his problem in these 
terms. However, it seems proper to 
distinguish legitimate and explicit 
research limitations from errors which 
merit criticism. Kinsey frequently 
acknowledges that measures such as 
orgasm do not encompass the entire 
domain of the problem, but he argues 
persuasively that such measures were — 
the best he could accomplish at this 
time. Stronger criticism, howevey — 4 
may be made when conclusions OVéf* — 
reach the definition of the domain. 4 
and the character of his data. Kinsey 


may be somewhat vulnerable on ij 
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score. In his first volume, virtually 
no direct data were collected on the 
psychological processes underlying 
the reported overt male sexual be- 
havior, and without such data some 
of the findings were uninterpretable. 
In the present volume, wherein Kin- 
sey seems even more concerned with 
interpreting the underlying meaning 
of his findings, with commenting on 
psychological processes as they dif- 
ferentiate males and females, and 
with drawing recommendations for 
social action, there is all the more 
need for such auxiliary data. Kinsey 
has become acutely aware of this 
need. In contrast to the earlier re- 
port, the present study makes con- 
siderable use of attitudinal and psy- 
chological data collected directly 
from the interview. A few illustra- 
tions will show how the new psycho- 
logical material is a basis for better 
inferences than the former bare rec- 
ord of overt behavior. Following a 
discussion of homosexual behavior, 
for example, there is a tabulation of 
the degree of regret, if any, that the 
Subjects felt concerning their homo- 
Sexual experience. Following a dis- 
cussion of masturbation, evidence is 
Presented on the degree to which 
Psychological disturbance had ac- 
companied the activity. Following 
à discussion of sexual contacts be- 
tween adults and children, informa- 
tion Js cited on the psychological 
Teactions of the children. 

In addition to these new data, 
there is acute use of auxiliary materi- 
als from an immense variety of 
Sources in other fields. Anthropologi- 
cal, biological, psychological, medical, 

'storical, and literary data on sexual 
poster are profusely and effectively 
ea to Support the findings and in- 
E pretations drawn from Kinsey's 

Wn original materials. The ex- 
Panded use of such supplementary 
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data in this current report appears to 
be a major contribution toward the 
placement of Kinsey's work in per- 
spective. 

Analytic techniques. The analytic 
procedures employed for the female 
report conform closely to the pattern 
established in the first volume. In 
general, sexual behavior is described 
by the incidence and frequency of 
any experience, and experience lead- 
ing to orgasm. These are considered 
under the following defined types of 
sexual activity: preadolescent sex 
play, masturbation, nocturnal dreams, 
heterosexual petting, premarital coi- 
tus, marital coitus, extramarital coi- 
tus, postmarital coitus, homosexual 
contacts, and animal contacts. The 
individual figures for each type of 
activity, and the “total sexual out- 
let" from all types, are then explained 
by examination of the variations in 
such behavior according to age, 
religion, educational status, marital 
status, etc. These explanatory fac- 
tors are the same ones used in the 
analysis of the male data, and the 
approach proves fruitful, for the 
differences reported from one group 
to another greatly increase our un- 
derstanding of the factors affecting 
sexual behavior. 

These statistical analyses are han- 
dled carefully. Kinsey examines his 
findings for spuriousness and system- 
atically controls factors that might 
be correlated with the characteristic 
under study. When he has insuf- 
ficient cases to pursue such analyses, 
he is careful to qualify his conclu- 
sions. Similarly, he is cautious about 
imputing causality. 

Presentation of the detailed break- 
downs by subgroups has additional 
values. First, it enables the reader 
to appraise the probable effect of the 
sampling biases which have been 
noted above. It is on this basis that 
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Kinsey argues that the heavy pre- 
ponderance of college women is not 
overly serious, since, in general, he 
finds little difference among the high 
school, college, and graduate school 
groups. He does, however, state 
that "major changes might have 
been introduced into the generaliza- 
tions if we had had a larger sample 
of females who had never gone be- 
yond grade school" [italics mine] (p. 
51). 

The use of the same social char- 
acteristics to examine the sexual ac- 
tivity of both males and females en- 
ables the reader to compare not only 
the differences between men and 
women in the ageregate, but also 
differences between men and women 
within specific subgroups, e.g., edu- 
cated, Catholic, etc. Thisisobviously 
a much more precise approach to the 
evaluation of what is a major social 
psychological contribution of Kin- 
sey's second report—the analysis of 
social conflicts which may arise be- 
cause of the discrepant sexual pat- 
terns of male and female partners. 
In marriage and in social life gener- 
ally, people normally interact with- 
in limited social groupings. Con- 
sequently, the important facts to be 
adduced are the patterns among men 
and women who are likely to meet, 
rather than the patterns for men and 
women in general. 

Although these parallel analyses 
by the same social groupings are 
therefore potentially valuable, Kin- 
sey does not seem to make sufficient 
use of them for their obvious purpose. 
Most of the conclusions about social 
conflicts arising from discrepant sexu- 
al behavior in men and women seem 
to be predicated on the aggregate 
samples. For example, Kinsey finds 
that the total number of men who at 
one time or another engaged in pre- 
marital coitus approximates the level 
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of 90%, while among women tl 
figure is 50%. He uses this dig 
crepancy in discussing the rise of th 
double standard and regards 
greater sexuality of young males 
compared with young females, 
creating social strains. But when 
incidence of premarital coitus 
examined for separate educational) 
levels, the figure drops to 68% among — 
the college-educated men and rises tg 
60% among the similarly educated” 
women. Within this particular group, 
therefore, the sex differe:ce tends ta 
disappear.  Unfortunatefy, Kins 
generally tends to rest his conclu: 
sions on the characteristics of the 
total samples. It should also be noted 
that while it is theoretically possible 
to make detailed subgroup compart 
sons between the sexes, the task i 
extraordinarily difficult for a readet 
While parallel use of the same 80> 
cial characteristics in the analysis of 
the behavior of both sexes has very” 
substantial value, it is strange d 
the female analysis should have beem 
so rigidly restricted to those factors” 
alone. There are other social char 
acteristics of American women pre 
sumably central to their behavior 
which are not comprehended at al 
under the traditional groupings th 
were used for the male. Obvio 
factors in female behavior are t 
presence or absence of motherhoo 


independent wage earner as 
trasted with the role of dependet 
nonworking wife or daughter. 
ceivably, these factors have no 
tion to sexual behavior, but the 
sumption that they do is strong 
is surprising that such analyses ? 
not reported in the present volum 
At the outset, it was empha 
that it is not easy to make a simi 
evaluation of Kinsey's new WO! 
The report represents, in this re 
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viewer's judgment, a considerable 
improvement over the first volume. 
Like all investigations, it has limita- 
tions and weaknesses but, as a pio- 
neer investigation of such scope into 
a very difficult research problem, it 
represents an. unusual achievement. 
Certainly, one cannot accept the 


427 


findings as the ultimate, definitive 
answer to problems of sexual be- 
havior, but the discrepancy from this 
research ideal is small. Kinsey's 
new volume should be regarded as a 
monumental contribution for which 
all psychologists can be grateful. 
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Part II 
Tur BrorLocrcar DATA 


: Kinsey and his associates present 
in the third and final part of this 
volume a comparison of the sexual 
activity of the male and female 
"...to discover some of the basic 
factors which account for the similari- 
ties and the differences between the 
two sexes" (p. 567). 

The comparison is not restricted to 
the data reported in the first volume 
and in the earlier sections of the pres- 
entone. Instead, it also includes sup- 
plementary data and a survey of the 
literature. The survey is selective 
and interpretive, and it represents 
the authors' conception of the nature 
of sex differences rather than a mere 
identification of what is known in 
this field. By their laborious and 
extensive study of one of the most 
important aspects of human activity, 
Kinsey, Pomeroy, Martin, and Geb- 
hard have earned the interest of a 
Wide audience in what they have to 
Say about sexual life. Their state- 
ments need not be supported by any 


data, provided that their views are 
represented as theory, opinion, or 
conception. However, insofar as 
they claim to have demonstrated the 
validity of their conceptions, the 
reader is entitled to question whether 
or not this is indeed so. 

Their writing is skillful and engag- 
ing, but their style is more popular 
than scientific. They overgeneralize 
from their data. They take cogni- 
zance of opposing viewpoints or con- 
tradicting data, and then proceed 
to advance their own viewpoint 
in complete isolation. A surprising 
number of important terms remain 
undefined, e.g., psychological factors, 
basic sex differences, sexual capacity, 
and capacity to be conditioned, 
among others. A lack of conceptual 
clarity is the consequence. It is 
necessary to interpret what they 
mean, since the meaning is often not 
evident. A good example is the sum- 
mary statement in their chapter on 
anatomic factors in sexual response 
and orgasm. 

In brief, we conclude that the anatomic 
structures which are most essential to sexual 


pro 
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response! and orgasm are nearly identical in 
the human female and male... . If females 
and males differ sexually in any basic way, 
those differences must originate in some other 
aspect of the biology or psychology of the 
two sexes. They do not originate in any of the 
anatomic structures which have been con- 
sidered here (p. 593). 


The precise meaning of this state- 
ment is that the erotic topography 
of male and female genitalia will not 
account for the higher reported fre- 
quency of orgasm of male volun- 
teers. 

The authors have singled out differ- 
ence in orgasm frequency as the 
"basic" sex difference. They could 
have considered other differences as 
“basic,” e.g., gross anatomic differ- 
ences, differences in degree of ag- 
gression, or attitudes toward chil- 
dren. They did not, perhaps because 
they believed the most valid data 
they could obtain and interpret were 
reports about orgasms. Initially, 
orgasm was represented as the prime 
criterion of sexual life, but subse- 
quently became identified with all 
of it. 

There are still other difficulties 
with the authors' account of the basic 
similarities and differences between 
the two sexes, and these will be dis- 
cussed within the context of five 
main propositions. They propose 
that: (a) sexual responses depend 
upon à "basic" anatomy which is 
essentially the same in the female 
and the male; (b) the physiological 
accompaniments of orgasm are (with 
minor exceptions) basically the same 
for the two sexes; (c) males are more 
readily aroused by sexual stimuli 
because they have been conditioned 
more frequently than the female; 
(d) males are more frequently con- 
ditioned because certain unidentified 


* They define sexual response as those physi- 
ological changes which lead to orgasm (p. 594). 
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structural characteristics of the cere- 
bral cortex give them a greater 
capacity to be sexually conditioned 
than the female; (e) there are sex 
differences in changes of frequency 
of orgasm with age, the levels of the 
17-ketosteroids correlating with these 
differences, 

They dismiss the contribution of 
anatomic factors to an understanding 
of sex differences in orgasm frequency 
on the basis of the following argu- 
ments: (a) the clitoris is the embry- 
ological homologue of the penis and 
is erotically as sensitive; (b) the va- 
gina is relatively insensitive to touch 
and is therefore erotically unimpor- 
tant; (c) female homosexuals prefer 
clitoral stimulation, and they should 
know what is most stimulating; (d) 
the preferred female masturbatory 
technique involves clitoral rather 
than vaginal stimulation; (e) there is 
no evidence for sex differences in the 
distribution of end organs of touch 
and sensory nerves. 

'The argument of the embryologic 
homologue is hardly pertinent. The 
fact that genital differences are diffi- 
cult to identify in the embryonic 
stage is no guarantee that functional 
differences will not emerge at a later 
stage. The bills of the shrike and the 
hummingbird are homologous struc- 
tures, but function in quite different 
ways. 

'Their denial of erotic importance 
to the vagina is based, in part, on @ 
study in which they asked five gyne- 
cologists to test the tactile sensitivity 
of the clitoris and other parts of the 
genitalia of nearly nine hundred 
women. The gynecologists used a 
glass rod applied lightly to test touch 
sensitivity. According to this study, 
the vagina appeared relatively M- 
sensitive to touch. 

The -pertinence of such a study 
hinges on whether the equation © 
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tactile and erotic is acceptable. This 
equation is questionable for three 
reasons: 

1. Tactile stimulation is always 
available in clothed males and fe- 
males without chronic erotic arousal. 

2. Even tactile sensitivity may 
change when genital tissue is en- 
gorged by sexual excitement. There 
is no indication that the gynecologi- 
cal exploration was accompanied by 
sexual arousal. 

3. Subjectively, tactile and erotic 
sensations are different. 

The same data are also used to 
contradict the view advanced by 
Freud and others that the processes 
of maturing psychosexually involve a 
shift of the location of the dominant 
erotogenic zone from the clitoris to 
the vagina. This shift the authors 
believe to be biologically impossible 
(p. 584). The submitted evidence 
has questionable relevance since what 
is required is an age comparison to 
determine whether the clitoris re- 
mains the dominant erotogenic zone. 
No age comparison is provided, but 
rather they cite as additional support 
the preference for clitoral instead of 
vaginal stimulation by women en- 
gaged in homosexual and masturba- 
tory activities. "This type of testi- 
mony can only complicate rather than 
clarify a problem in which the critical 
independent variable is psychosexual 
maturity. 

Their whole discussion of anatomi- 
cal differences in the distribution of 
end organs of touch, of the density 
of sensory nerves in the penis and 
clitoris, in the size of the breast, 
and of other parts of the body has 
no bearing on an understanding of 
the sex difference in orgasm count 
Unless these anatomic factors can 
s Shown to affect orgasm-seeking be- 
avior differentially. This assump- 
tion is implied but not examined. 
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In view of the authors’ belief that 
the vagina is “of minimum impor- 
tance in contributing to the erotic re- 
sponses of the female" (p. 592), and 
since coitus necessarily involves vag- 
inal stimulation, it is curious that 
they did not conclude that anatomic 
factors contribute to sex differences 
in orgasm count. 

In discussing the physiology of 
sexual response and orgasm, they re- 
view certain muscular, vascular, and 
glandular responses before, during, 
and after orgasm. They base their 
review on a survey of the literature, 
references to some of their own inter- 
view data, and what is apparently a 
substantial amount of undocumented 
personal observation. While cited 
studies show rises in pulse rate, blood 
pressure, peripheral blood flow, forced 
respiration, salivary secretions, gener- 
al muscular activity, etc. during or- 
gasm, the evidence bearing on sex 
differences in these responses at the 
human level is variable and negligible. 
The authors conclude that female and 
male are quite alike as far as the data 
yet show in regard to all of these 
changes. While the cited physiologi- 
cal accompaniments of orgasm have 
intrinsic interest, their relevance to 
orgasm-seeking behavior is again as- 
sumed and not critically examined. 

The sex difference in speed of or- 
gasm receives special consideration as 
an attribute of physiological re- 
sponse. They state: 

There is a longstanding and widespread 
opinion that the female is slower than the 
male in her sexual responses'and needs more 
extended stimulation in order to reach or- 
gasm. ..« 

Certain it is that many males reach orgasm 
before their wives do in their marital coitus, 
and many females experience orgasm in only 
a portion of their coitus . . - but our analyses 
now make it appear that this opinion is based 
on a misinterpretation of the facts (p. 625). 


After comparing the speed of mas- 
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turbation of males and females, they 
state that '*...the female is not 
appreciably slower than the male in 
her capacity [italics mine] to reach 
orgasm" (p. 626). 

Then they conclude: “But because 
females are less often stimulated by 
psychological factors, they may not 
respond as quickly or as continuously 
as males in socio-sexual relation- 
ships" (p. 641). 

It is clear from the above that 
Kinsey and his collaborators are not 
disputing the existence of differences 
in speed of sexual response in a 
heterosexual relationship. Rather, 
they deny a difference in "capacity" 
to reach orgasm. Apparently, capac- 
ity is what is left after the influence 
of experience is separated out. This 
view emerges in the following quota- 
tion: 

For instance, the exceedingly rapid re- 
sponses of certain females who are able to 
reach orgasm within a matter of seconds from 
the time they are first stimulated, and the re- 
markable ability of some females to reach 
orgasm repeatedly within a short period of 
time are capacities [italics mine] which most 
other individuals could not conceivably ac- 
quire through training, childhood experience 
or any sort of psychiatric therapy. Similarly, 
it seems reasonable to believe that at least 
some of the females who are slower in their 
responses are not equipped anatomically or 
physiologically in the same way as those who 
respond more rapidly (p. 377). 


High-speed, high-frequency orgasm 
apparently represents a constellation 
of organic qualities which, like germ 
plasm, are relatively immune to the 
vicissitudes of experience. However, 
high-speed, high-frequency orgasm 
may be given a variety of psycho- 
logical interpretations including de- 
nial of homosexual feelings, feelings 
of inadequacy, and expressions of 
guilt or exhibitionism, among others. 
Accordingly, the reviewer believes it 
is unwarranted for the authors to 
dismiss sex differences in speed of 
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response by first separating psycho- 
logical factors from the performance, 
and then using speed of masturbation 
as the metric for sexual “capacity,” 

Having eliminated anatomical and 
physiological factors (exclusive of 
hormones) as an explanation of sex 
differences in orgasm frequency, the 
authors then ascribe the main role 
to psychological factors. Condition- 
ing carries the heuristic burden. Men 
are more often conditioned by their 
sexual experience (p. 649). This view 
is deduced from a sexual comparison 
of reports of arousal in thirty-three 
situations. These situations include 
observing the opposite sex, one's own 
sex, portrayals of nude figures, own 
genitalia, etc. In 29 out of 33 of these 
situations, men reported a higher 
incidence of arousal than women. It 
is difficult to assess how many of 
these reported differences reflect a 
greater female sensitivity to our re- 
strictive codes on such matters, à 
sensitivity which may have survived 
the two-hour interview with the Kin- 
sey staff. For example, consider the 
four out of thirty-three sets of data 
in which the sex differences either dis- 
appeared or were inverted. A higher 
incidence of women than men re- 
ported arousal from observing moving 
pictures. The incidence of arousal 
was about the same for men and 
women when observing their own 
sex, reading romantic literature, am 
when being bitten. These questions 
appear to the reviewer to be either 
less loaded for women, or are equally 
or more heavily loaded for men. For 
example, to report arousal from 09- 
serving a nude of the same sex imr 
plies an admission of homosexu 
feelings which would be equally un- 
desirable for both sexes. 

Let us assume that the reports are 
uncontaminated by socially deter- 
mined sex differences in what is aC 


table sexual behavior and treat 
as valid data on sex differences 
ousal. How do the authors 
in the difference? For them to 
ite that the male is conditioned by 
> experience more frequently 
an the female is no more than a 
ification of their findings. We need 
now why they condition more fre- 
ently. Is it due to a socially more 
ant attitude to the male when 
ks such experience? Is he more 
sely interested in these experi- 
8? Does his greater aggression 
1 to a more active expression and 
atification of sexual interests? The 
nation of Kinsey and his associ- 
does not become obvious until 
discuss neural factors in sexual 
nse where the following state- 
Nt appears: 
ce there are differences in the capacities 
emales and males to be conditioned by 
Sexual experience we might expect simi- 
ferences in the capacities of females and 
to be conditioned by other, non-sexual 
of experience. On this point, however, 
not yet have information (p. 712). 


EP. 

"Apparently, from the cited state- 
it, the authors believe that the 
her frequency of human male 
al conditioning is due to his 
er "capacity" to establish an 
jation between substitute and 
lable sexual stimuli, and that this 
pacity might extend to other types 
reliable stimuli as well.’ This ex- 
nation (stated as a finding) seems 
tched. It assumes that the male 
! a greater capacity for associating 
lass of substitute stimuli which 
ire not peculiar to sex (since almost 
y stimulus can be associated with 
al response). There is no evi- 
nce for an intrinsic superiority of 
€ male in associating in any field. 
There are more plausible explana- 


"Lam assuming that conditioning is a form 
ciation. 
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tions. The social code for the human 
female may be more restrictive, she 
may fear pregnancy, she may not 
feel the need for intercourse as often 
as the male, the male may be more 
aggressive—to mention a few. Kin- 
sey and his associates mention these 
possibilities in one connection or an- 
other, but they reject them in favor 
of their "capacity" theory. 

In discussing the locus of the neural 
mechanisms of sexual response, the 
authors point out that components of 
the response pattern, tumescence and 
ejaculation, are possible without sen- 
sation in paraplegics. The organism 
can thus function at the reflex level 
via the spinal cord without cerebral 
intervention. But while the sympa- 
thetic and parasympathetic nervous 
systems are also involved in the nor- 
mal situation, there is no definitive 
evidence concerning the role of the 
hypothalamus. 

The evidence on the role of the 
cortex is contradictory. On the one 
hand, a study on prefrontal lobotomy 
patients 3.3 years before, and another 
3.7 years after psychosurgery, shows 
no appreciable deviation in orgasm 
frequency from corresponding normal 
age and sex groups. On the other 
hand, the authors refer to data which 
show that damage to the cerebrum 
as a whole, and particularly to the 
cortex, may reduce an animal's 
capacity to react to psychosexual 
stimuli. The reduction is directly 
proportional to the extent of damage 
to the cortex. In spite of the contra- 
dictory evidence, they conclude that 
"although the data on the relation 
of the cortex to sexual behavior are 
limited, they do show that this is the 
part of the nervous system through 
which psychosexual stimuli are medi- 
ated" (p. 712). 

There are few performances in- 
deed which cannot be credited, with 
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equal justification, to the cerebral 
cortex. The authors find support for 
their position from the demonstra- 
tion that the male rat's copulatory 
behavior (and that of other animals) 
is more dependent on an intact cor- 
tex than that of the female. His 
performance requires more direction, 
coordination, and initiative. These 
differences make the male's copula- 
tory behavior more vulnerable to 
cortical damage, but it does not fol- 
low, even in the rat, that cortical 
differences are the cause of their sex 
differences. 

The final chapter is more carefully 
written. In it the authors address 
themselves to the problems of sex 
differences in the effects of aging on 
frequency of orgasm, and whether or 
not these differences have hormone 
correlates. "Table 1 condenses and 
reflects the data they provide on age 
trends. 


TABLE 1 
WEEKLY FREQUENCY OF ORGASM FROM 
ALL CAUSES 

Age 15 Age 50 
Single male 2.3 tA 
Single female 4 A 

Age 20 Age 50 
Married male 3.2 1.3 
Married female — 2.2 .8 


n OH 


What hormone changes correlate 
with these differences? Data on hor- 
mone assays were not obtained from 
Kinsey respondents, but from studies 
reported in the literature. Estrogen 
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and androgen levels do not correlate 
with the Kinsey data. The changes of 
17-ketosteroid levels with age show 
a closer relationship, but there are 
marked deviations in the age range 
from 15 to 27, making a simple causal 
relationship untenable. 

In addition to the hormones men- 
tioned, they consider the effects of 
hormones from the pituitary, the 
thyroid, and others from the adrenals. 
While the latter may affect sexual 
behavior, it is largely as a result of 
their influence on the general meta- 
bolic level. 

They conclude that no hormone 
affects sexual preferences, interests, 
or techniques in a selective way, but 
may rather modify the general level 
of sexual activity. 

To sum up, the conception of the 
nature of sex differences which Kin- 
sey and his associates have elaborated 
is at best awkwardly stated and mis- 
leading in many particulars. It would 
be wise to distinguish sharply be- 
tween the valuable data that they 
collected and their interpretation of 
sex differences. While the authors 
acknowledge repeatedly the impor- 
tance of psychological factors in 
understanding sex differences in or- 
gasm frequency, the constructs that 
they have made central to their pres- 
entation are sexual capacity and con- 
ditioning capacity. Since they think 
of capacity as something devoid of 
experience, we are left with the ques- 
tion, "What do they mean by psy- 
chological?” 


- psychoanalysis, its 
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Jones, Ernest. The life and work of 
Sigmund Freud. Vol. I. The forma- 
live years and the great discoveries: 
1856-1900. New York: Basic 
Books, 1953. Pp. xv+428. $6.75. 


This volume is a thorough, careful, 
and erudite account of the origin of 
invention and 


- design by the greatest of psycho- 


analysts, and written by the only 
member of his privy council still 


_ living. The choice of this biographer 


was not only fortunate but almost 


essential, for he is the one remaining 


member of those six whom Freud 
chose after the first world war to keep 
alive his radical new orthodoxy. 
Twice Freud had destroyed his let- 


i ters, diaries, manuscripts, and notes 


in order to preserve his personal 
privacy, once with a chuckle as to 
how much work he was making for 
his biographer (p. xii). But others 
had kept letters; the elaborate cor- 
Tespondence with Fliess (1887-1902) 
Was available, and the Freud family, 
àt first disposed to protect Freud's 
privacy, finally put everything at 
Es discretion, because they 
E that truth was to be pre- 
b €d to the false legends that were 
owing up about this controversial 
E Jones was thorough, as 263 
otnotes and 664 citations of sources 
attest, 
only volume describes, of course, 
TRA e immature Freud, the in- 
Eis motionel, insecure, frustrated, 
E xe impractical Freud, the 
Bs etes with making his 
es n his profession and in life and, 
$ proved ultimately, in history, 
marry aid Freud too poor to 
ordin is fiancée and for a time sub- 
ating other ambitions to achieve 


that end. This was the Freud of 
1880-1900. Six-sevenths of the 
volume treats of these two decades. 
Two other volumes are promised for 
the four decades that remain. So 
often Jones remarks on the contrast 
between this hard-driving, insecure, 
uncertain man and the wise, assured, 
tolerant, understanding savant who 
was the only Freud his disciples ever 
knew. This first book, however, does 
not show how Freud wrought his own 
maturity by self-analysis from 1897 
on. It discusses the analysis and lets 
the results remain to be seen in later 
volumes, leaving the life far short of 
culmination and this review merely a 
comment on an initial achievement 
whose meaning and value in 1900 
were to be assessed by events that 
were yet to come. 

It is not easy to perceive the whole 
Freud for any one year within these 
two intense decades. The biographer 
has chosen to devote separate chap- 
ters to different themes with greatly 
overlapping periods. Six chapters, 
for instance, deal respectively with 
six more or less synchronous histories. 

1. First there is Freud’s brief medi- 
cal career, following his unduly pro- 
longed study as a medical student, 
the period in which he acquired am- 
bition (1882-1885). Actually it was 
his intense desire to marry that 
awoke professional aspiration in him, 
but Freud's relation to his fiancée is 
left for a later chapter. 

2. Then there is Freud's discovery 
of the medical properties of cocaine 
and its value for internal use as an 
analgesic and euphoric (1884-1887). 
Incorrectly Freud, filled with en- 
thusiasm for his new finding, ad- 
vertised cocaine as not habit form- 
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ing, and lived to regret this ill con- 
sequence of his impetuosity. He 
missed, however, making the greatest 
discovery, the use of cocaine exter- 
nally by injectionasa local anesthetic. 
He had handed the preliminary in- 
dications of this application over to 
someone else, who presently and 
quite properly got the credit. Freud 
seems to have been blocked in push- 
ing this line of research, perhaps be- 
cause he had no taste for manipula- 
tory experimentation or perhaps 
because his desire to see his fiancée 
after a long absence won out over his 
staying in Vienna to prosecute these 
obvious experiments that would, as 
it turned out, have advanced his 
career. 

3. Another chapter in the book 
deals with Freud's betrothal, per- 
sistently frustrated by his poverty 
(1882-1886). His fiancée could ex- 
pect no sizable dowry, and Freud 
seems at the first to have been driven 
to professional work in order to get 
enough money to marry. Again and 
again indigence prevented him from 
visiting his betrothed or being able 
to give her presents. When you read 
of these years in this chapter, you 
could think that wishing and plan- 
ning for marriage and writing love 
letters took up all of Freud's time, 
but you would be wrong. He was 
pursuing half a dozen other activities 
simultaneously. 

4. The chapter on Freud's mar- 
riage (1886) shows how need reduc- 
tion quickly brought satisfaction, and 
thereafter parenthood (six children) 
and a ready lifelong monogamy. One 
cannot help believing that Freud 
was strengthened in support of his 
own theory of sexuality and in with- 
standing the odium sexuale that was 
directed toward him, because his own 
sexual desires were essentially con- 
ventional, both as to monogamy and 
as to male dominance. 
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5. The chapter on the neurological 
period depicts Freud at first still | 
convinced that histology and mor- 
phology are the best biological disci- 
plines, the most scientific, a view due 
in part to the influence first of Brücke 
upon Freud, and later of the his- 
tologist Meynert (1883-1897). Freud 
was by early preference a physicalist, 
He preferred organic to functional 
explanation and disliked what was 
in those days called psychology. 
Brücke was one of the four pupils of 
Johannes Müller who had declared 
themselves vehemently against vital- 
ism in the 1840's.  Helmholtz was 
another, and Jones throughout this . 
book speaks of the influence of the : 
“Helmholtz school" as directing 
Freud away from the vagaries of 
psychology towz: the hard facts 
of physiology and the harder ones of 
anatomy. Anatomy, however, did 
not work out then any better than 
now to provide an understanding of 
human thought and conduct, and 
Freud's writing of his volume on 
aphasia in 1891, after he had been - 
exposed to the magic of Charcot in 
1885 and had transferred his loyalty 
from Meynert to Charcot, witn 
his acceptance of functional explana- 
tion when organic factors could not 
be found. Later, of course, he turn 
to psychology, still recognizing the 
values of the Helmholtz school by 
characterizing the new psychological 
principles as ‘‘mechanisms.” 

6. Then there is Freud's long and 
intimate association, both persona 
and scientific, with Breuer, fourteen 
years his senior. Breuer believed in 
Freud. He lent him money that was — 
seldom repaid. Both of them E 
investing Breuer's money, Freud : 
lieved, in the promising enterprise © 
Freud's thought and research. pi 
first the two men were associated fi 
work, and it was Breuer who dis 
covered the cathartic method 4^ 


- had become available. 
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thus in a way began psychoanalysis. 
Later Breuer, frightened by the phe- 
nomenon of transference in a woman 
patient, escaped from the field, leav- 
ing Freud, who had greater persist- 
ence and no sense of guilt when pa- 
tients fell in love with him, to carry 
on. Much later, when Fliess had re- 
placed Breuer as the father-image in 
Freud's life, Freud turned to bitter 
criticism of Breuer in his letters, but 
he never attacked him in print. 

So there you are. If you pick out 
the year 1885, for instance, you find 
Freud eating his heart out to get 
married, yet with time and energy 
left for hospital work, for his cocaine 
research, for his study of the neurol- 
ogy of the medulla, for his work with 
Breuer after the cathartic method 
Any other 
year is equally full. Jones, it seems to 
me, fails to make you feel the inten- 
sity and range of Freud's avidity; 
yet you can figure it out for yourself 
if you synthesize the chapters. 

The immature Freud needed a 
father-image, someone to whom he 
could defer and whose approval and 
support he could win. He had three 
in succession during his professional 
life: Brücke, Breuer, and Fliess. 
After that his self-analysis rendered 
him strong enough to get along with 
disciples only. 

It was from Brücke, rigorous physi- 
ologist and neurologist, that Freud 
got the faith in neurological determin- 
ism which later in the Traumdeutung 
of 1900 he turned into psychic deter- 
Minism, 

Breuer was the faithful admiring 
friend and collaborator. He gave 
Freud much and required little. He 
could not, however, bring himself to 
Continue collaboration, and later, 
as Freud’s theory of sexuality began 
to dominate his professional thought, 


à coolness developed between the 
two, 
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After Breuer, Fliess! Nowadays we 
should call Fliess a brilliant numero- 
logical crank. He believed passion- 
ately that all living matter, even the 
individual cell, is bisexual, that living 
nature is based on periodicity, and 
that the basic periods are 28 (the 
female period, the menstrual cycle) 
and 23 (the male period, the interval 
between menstrual cycles). Freud, 
no mathematician, was captivated 
by Fliess's ability to use mathe- 
matics to compute conclusions out 
of these basic numbers, but actually 
what the two gave each other was 
mutual admiration and the self-confi- 
dence that goes with appreciation and 
the appearance of understanding. 
They wrote to each other at least 
once a week, criticized each other's 
manuscripts, got together in impor- 
tant special meetings (they called 
them "'congresses") whenever they 
could. Freud played the dependent 
role, and, when the break came later, 
Fliess initiated it. This was an in- 
tense 15-year friendship (1887-1902), 
which illustrates well the operation 
of human needs but does little to 
dignify, the character of Freud. No 
wonder he destroyed his half of this 
correspondence aíter his self-analysis. 
Freud's to Fliess were saved.* 

You have to do a little guessing and 
a little reasoning to find out how 
Freud worked and what it was he 
called work. He made “discoveries,” 
and the subtitle of the book is “The 
Formative Years and the Great Dis- 
coveries.”’ You know that he worked; 
that he worked hard; that he worked 
best when he did not feel well; that 
the intensity of his work often wore 
him out; that his successes were 
capricious, for there were long times 


1 And have been published since this re- 
view was written. FREUD, SIGMUND. The ori- 
gins of psychoanalysis: Sigmund Freud's letters 
to Wilhelm Fliess New York: Basic Books, 
1954. $6.75. : 
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when no insight would come and then 
suddenly a particular problem would 
find itself resolved. He seems ir- 
regularly to have had these insights, 
then to have formulated hypotheses, 
and then to have adjusted and re- 
shaped them as he kept applying 
them to his patients in analysis and 
in former analyses. He seems almost 
never to have accepted a new hy- 
pothesis quickly, but only slowly and, 
as it were, reluctantly, after much 
checking. Nor did his accepted hy- 
potheses always stay right. There 
was Freud's belief about adult hys- 
terical symptoms’ always having 
their roots in infantile sexual experi- 
ence. What at first seemed to con- 
firm this belief was the constant re- 
port in analyses of incestuous rela- 
tions between parents and children. 
Freud, as well as Breuer, had diffi- 
culty in accepting this finding as a 
fact, and then suddenly in 1897 
Freud realized that these attempted 
seductions were nothing more than 
patients' fantasies that had never 
really occurred. That “discovery” 
was a blow to Freud, for it seemed to 
knock out half of the structure of 
psychoanalysis, leaving intact only 
the theory of dreams as wish-fulfill- 
ment. Actually, as it turned out, the 
"discovery" strengthened the theory 
of infantile sexuality, weakening only 
the false belief in parental sexuality. 

Although Freud usually was slow 
to accept these insights, once in a 
while an insight was sudden enough 
to be dated. One such instance is 
indicated by Freud's suggestion, in a 
moment of gaiety, that there should 
be erected a marble tablet at the 
table near the northeast corner of the 
terrace of Bellevue Restaurant in 
Vienna. The tablet was to read: 
"Here the secret of dreams was 
revealed to Dr. Sigm. Freud on July 
24, 1895” (p. 354). 
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Freud's technic was not, of course, 
the experimental method. He him- 
self remarked in 1900: "I am not 
really a man of science, not an ob- 
server, not an experimenter, and not 
athinker. I am nothing but by tem- 
perament a congquistador—an ad- 
venturer" (p. 348). And at another 
time: “It seems to be my fate to dis- 
cover only the obvious: that children 
have sexual feelings, which every 
nursemaid knows; and that night 
dreams are just as much wish-fulfill- 
ment as day dreams" (p. 350). On 
the other hand, one should not take 
Freud's modesty and momentary 
self-depreciation too literally. He 
thought of himself as a discoverer, 
and those insights that stayed valid 
were the result of long and laborious 
thinking within the monotony of 
psychoanalyses and their repeated 
review. Freud’s technic lay some- 
where between that of the experi- 
mental psychologist, who alters an 
independent variable and observes 
the result, and the philosopher-psy- 
chologist, who induces generaliza- 
tions about human nature from the 
reservoir of his experience. Freud 
made his generalizations from a 
wealth of specialized experience and 
then tested his hypotheses out against 
particular cases, increasing his as- 
surance about the validity of each 
induction as the number of consistent 
cases grew. He had, however, no 
control, either in the sense of the 
rigorous constraint of contributing 
factors or in the sense of adding the 
method of difference to the method of 
agreement. Indeed he seems to e 
been restricted to Mill's method o 
agreement, pure and simple, a meth- 
od which by itself is clearly unsafe. , 

Was Freud original? He fitted his 
Zeitgeist, of course, as all great men 
do. Brücke influenced him in T€ 
pect of determinism, and Meynert n 


BOOK REVIEWS 


respect of active ideas and their re- 
ion, for Meynert was an admirer 

of Herbart, who in these matters is 
clearly an intellectual ancestor of 
Freud's. Jones mentions as also in- 
fluential Fechner and Griesinger, both 
of whom derived many of their 
‘deas from Herbart. Breuer's influ- 
ence we have already noted. Charcot 
captured Freud’s admiration when he 
visited him in 1885 and later Bern- 
heim became a lesser influence. Char- 
cot it was who led Freud to shift his 
allegiance from the organic to the 
functional and thus away from Mey- 
nert. Jones does not find that Bren- 
tano influenced Freud, although 
Freud knew him, sat under him in 
lectures, translated a volume of J. S. 
Mill at his recommendation. There 
is no relation between Janet and 
Freud except that both were affected 
by the same Zeitgeist and by Charcot. 
„Yet all this fitting of Freud into 

his intellectual genetics is not very 
important. Certainly Freud was 
vastly original. His conceptions of 
infantile sexuality and of the psy- 
chological mechanisms are not to be 
found in Charcot or elsewhere, and 
the importance of his originality is to 
be measured by its effects. In this 
Teviewer’s judgment Freud is a great 
man because he influenced thought 
and civilization more than any other 
Person whom psychology might claim 
in the last three hundred years. A 
Tolstoy might have shown him as 
More the agent of an irresistible 
Zeitgeist than as a great originator, 
and still he would be a Great Man, 
for that is what Great Men are. 
Kings are History’s slaves. That the 
“ings who have commanded the 
Choices progress should make are 
singled out by posterity to be called 
great is the proof of man's essential 
insecurity and his need to find safety 
1n pride and reverence for others than 
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himself. Freud would have recog- 
nized such a fact without losing his 
faith in Helmholtzian determinism; 
he would have believed that greatness 
is not diminished by its having 
causes. 
EpwiN G. BORING. 
Harvard University. 


Frovp, W. F., & WELFORD, A. T. 
(Eds. Symposium on fatigue. 
London: H. K. Lewis & Co., 1953. 
Pp. vii4-196. 24s. 


This report of both English and 
American workers is concerned with 
various aspects of inadequate per- 
formance of the human, and tech- 
niques for detecting and measuring 
them. Both old and new slants are 
represented. While the symposium 
is labeled as one on fatigue, no verbal- 
ized convention is agreed upon as to 
what is fatigue and what is to be 
otherwise defined. In some cases, 
the participants admit that they are 
not really reporting on fatigue but 
rather upon decreasing performance 
(R. C. Browne). Some tacitly imply 
that fatigue is performance decre- 
ment (R. M. Gagné), some explicitly 
state it (W. T. Singleton). A. T. 
Welford explicitly recognizes that 
there are several distinctly different 
phenomena called fatigue. One is a 
subjective state related to some kind 
of physical or mental strain. The 
other is a decrement in performance 
following more or less prolonged ac- 
tivity. He states that the psycholo- 
gist takes a position “between the 
two.” This is to say that the psychol- 
ogist must “keep in touch with the 
man in the street"; he must “build 
his theories in such a way that they 
can contain the experiences and be- 
havior of ordinary people." Welford 
further states that “it is to physi- 
ology that he must look for explana- 
tions." 
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Others try to connect fatigue with 
learning and drives as defined in 
prevalent learning theory (D. R. 
Davis). It seems as though learning 
deserves psychological theorizing but 
that fatigue does not. T. A. Ryan 
deals very ably with the factor of 
effort in performance and thus comes 
closest to saying that fatigue is a 
proper study for psychologists along 
side other phenomena such as anxi- 
ety. Though he did not, Ryan might 
have said that since he found effort 
definable only in experiential terms, 
the sine qua non of fatigue is an identi- 
fiable experience, and at the same 
time a respectable object of scientific 
inquiry. 

Nowhere throughout the printed re- 
port of the symposium does there 
seem to be a recognition that the 
topic of fatigue is in an undefined and 
therefore unusable state for intelli- 
gent discussion and inquiry. There 
was no recognition of the logically 
intolerable condition of having a 
number of terms used synonymously 
in technical discourse; e.g., fatigue, 
boredom, tiredness, work decrement, 
disorganization of performance, and 
impairment. Because of this defi- 
ciency and what follows from it, all 
these very careful and ingenious 
Studies lack the orientation and 
meaningfulness that they would 
otherwise have. 

It is to be hoped that the need to 
systematize knowledge and informa- 
tion, and to orient ourselves by sys- 
tematic thinking, will soon be recog- 
nized to be as urgently required as 
the continued collection of more data. 
To close with a positive note, it can 
be said that the reader will find in 
the symposium a number of ingenious 
experiments that are well worth his 
acquaintance. 

S. HOWARD BARTLEY. 

Michigan State College. 
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HovLAND, CARL I., JANIS, IRVING L., 
AND KELLEY, HAROLD H. Com- 
munication and persuasion. New 
Haven: Yale Univer. Press, 1953, 
Pp. xii4-315. $4.50. 


This volume brings together the 
findings of the Yale Communication 
Research Program, a program de- 
voted to the study of the modification 
of attitudes and opinions through 
communication by means of con- 
trolled experiments. The project was 
set up as a cooperative research and 
study group rather than in a central- 
ized, hierarchically organized form. 
Approximately 30 individuals con- 
tributed to the work, each being en- 
couraged to pursue his own interests 
and theoretical bent. 

Ideal as this plan may be from 
many points of view, it clearly has 
made the job of reporting a difficult 
one. The researches are uneven in 
quality, and stem from a variety of 
theoretical approaches, including, 
among others, Hull's learning theory, 
some motivational hypotheses of 
Freud and other psychoanalysts, and 
some of the formulations of Lewin, 
Sherif, Newcomb, and others con- 
cerning the effects of group member- 
ship. The authors of Communication 
and Persuasion, therefore, adopted a 
structural organization, using topical 
headings of (a) the communicator, 
(b) content of the communication, 
(c) audience predispositions, includ- 
ing group conformity motives and 
individual personality factors, and ( 
responses, including overt expression 
of opinion and retention of opinion 
change. In the light of the formidable 
organizational task they faced, the 
authors are to be congratulated on the 
book's structural unity, ease of tram 
sition, and readability. | 

The experiments themselves dea 
almost exclusively with the effects 9 
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one-way communication on “cap- 
tive" audiences, opinion change and 
retention being assessed by means of 
questionnaires. The resulting limi- 
tations are obvious, but since most of 
the questionnaires were carefully 
worked out with numerous cross 
checks, and since all the experiments 
are primarily directed toward basic 
theoretical rather than immediately 
practical aims, it seems likely that 
many of the generalizations will hold 
for more complex communication 
situations evaluated in other ways. 
The probability of this being so is 
increased by the fact that the authors 
have made a commendably thorough 
review of the relevant literature, 
which is presented for each experi- 
ment or set of experiments as the 
context within which the new find- 
ings are considered. In addition, 
most of the experimenters have made 
admirable use of advanced experi- 
mental designs which allow the assess- 
ment of numerous variables at once, 
both singly and in interaction. 

In summary, this book is an ex- 
cellent report of studies that make 
à substantial contribution to an im- 
portant area of social psychology. It 
755 a source of many experimentally 
derived, thought-provoking general- 
izations concerning the processes of 
Communication and opinion change. 

hese generalizations, however, tend 
to be quite limited, specific, and ten- 
tative, thus reflecting the present 
State of knowledge in the field, the 
over-all eclectic nature of the research 
Teported, and the authors' preference 
for sticking close to the available 

ata. The volume also provides an 
€xcellent review and analysis of a 
arge area of communication litera- 
ture, as well as a number of valuable 
Suggestions for future research. 

r F. P. KILPATRICK 

Princeton University. 
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BINGHAM, WALTER VANDYKE. Homo 
sapiens auduboniensis. New York: 
National Audubon Society, 1953. 
Pp. 39. $1.10. 


A significant, if not the most con- 
spicuous, phase of the late Walter 
Bingham’s qualities receives a side- 
light from the address “Homo Sapi- 
ens Auduboniensis," which he gave 
in October 1937 at a dinner of the 
Audubon Camp held at the American 
Museum of Natural History. The 
address gives its title to a recent 
memorial volume of which it is the 
nucleus. 

Following a brief, touching fore- 
word in the name of the Audubon 
Society, there is a memoir by Walter 
Bingham's wife, Millicent Todd Bing- 
ham, which embodies the major por- 
tion of the 39-page text. Its title, 
“Beyond Psychology,” is not invidi- 
ous to our discipline, meaning rather, 
“What should they know of Bingham 
who only Bingham knew” for his 
professional accomplishment, hav- 
ing but fragmentary perception of his 
various reachings into the humanities 
and the arts, particularly music; not 
to say the depth and breadth of good 
will he was so apt in implementing, 
his steady cheeriness, his hard prac- 
tical sense, his effectiveness in physi- 
cal emergency. 

Then come the five pages of the 
“Homo Sapiens Auduboniensis" it- 
self. The pages that one of our group 
would least want to miss are the two 
at the volume's close, a (1948) letter 
of Bingham's to the able and ener- 
getic director of the Audubon Camp, 
Carl Buchheister. It concerns the 
lag of social behind material culture 
and something to do about it; com- 
pare also the note on an earlier letter 
to Clark Wissler (pp. 26-27). The 
address itself, as implied in its title, 
is mainly a whimsical characteriza- 


440 


tion of Audubon Camp participants, 
phrased in natural history terms. For 
Bingham's professional colleagues, 
this theme has perhaps less intrinsic 
interest than other portions of the 
volume, though towards the close he 
emerges from the role-taking more 
fully. The remarks have a good deal 
more interest for the psychographer, 
for the part that humor played in 
Bingham's personality, a sort of test- 
ing the limits of facetiousness that 
he would allow himself. 

What was the role in Bingham's 
personality of the "humor" that Mrs. 
Bingham simply mentions (p. 15) 
after rather discounting it on the 
preceding page? In humor's ethically 
highest reach,! Bingham was at home 
with the best minds. In the other 
levels, more connoted by “sense of 
humor," he was progressively less at 
home. Fun-loving would hardly de- 
scribe him, though he was ready 
enough that others should be so. In 
the stereotype of the salon he would 
have passed for a relatively humorless 
man. Still less was he one to deal in 
the Joke Proper; and Flippancy 
tended to disgust him. At any level, 
the humor of “Wit” was not Bing- 
ham's cup of tea; that of Joy em- 
phatically was. 


1*5, , . Joy, Fun, the Joke Proper, and Flip- 
pancy. You will see the first among friends 
and lovers reunited on the eve of a holiday. 
++. Something like it is expressed in much of 
that detestable art which the humans call 
Music. . . ." (C. S. Lewis, The Screwtape Let- 
ters, p. 57). 
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The ego-defense dynamic of wit 
seems well established, though it 
matters much what it is a defense 
against; compare Abraham Lincoln 
with Jonathan Swift. In a character 
like Bingham there is little need for 
such defense, and it does not develop 
readily or far. To a whimsicality 
like "Homo Sapiens Auduboniensis" 
he could bring himself; satire would 
have been mostly beyond him, or 
beneath him, as you please. Try to 
imagine Walter Bingham composing 
a Lilliput, let alone a Modest Proposal. 

Bingham's professional colleagues 
may well thank the National Audu- 
bon Society for their graceful tribute, 
and for the light it throws on this 
comparatively unconsidered facet of 
Bingham's rich personality. Beyond 
this, of course, looms the whole 
question of the role of nature study in 
the benign socialization of the indi- 
vidual. It can bea negativeone. How 
much did love of nature help to 
make Bingham what he was? How 
much did Bingham, being what he 
was, have to respond to nature in 
this way? We are not likely to know. 
What we do know is that this love 
of nature had no small part in one of 
the best and wisest men that ever 
made psychology his profession; that 
he had a lively and steadfast faith 
in its benign influence on others, and 
in its communicability. 


FnED L. WELLS. 
Newton Highlands, Massachusetts. 
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This paper presents a survey of 
Japanese psychology, first, from a 
historical point of view, and second, 
from the point of view of describing 
some of its modern aspects. The occa- 
sion for organizing the paper arose in 
the summer of 1952 when the two 
authors were members of the Seminar 
in Experimental Psychology, one of 
the Kyoto Seminars in American 
Studies conducted under the auspices 
of Kyoto University, Doshisha Uni- 
versity, the University of Illinois, and 
the Rockefeller Foundation. Many of 
the ideas expressed and information 
contained in the account are products 
of the many discussions that went on 
among the Seminar participants 
along with the formal academic ac- 
tivities,! 

4 The present survey attempts, with- 
in its limitations, to trace, in general 
p» and without claim to ex- 
Corin certain historical pat- 

™s of Japanese psychology and 
is emergence into the modern 

The m of fact, theory, and practice. 

"i E enumerates many re- 
Tches that have been carried out 
A Y Japanese investigators, researches 
the x not usually well known in 
dr E It is hoped that the pres- 
est in rn will stimulate inter- 
ese contributions. 
! We are greatly i 

Fukuhara of Dosiha Geen, Rene 
on Whig ws help in accumulating materials 

ich this report is based. 


History or PSYCHOLOGY IN JAPAN 
Psychology until 1926 


Western psychology was intro- 
duced in Japan about 10 years after 
the beginning of the Meiji Era (1868- 
1912) against an ancient background 
of Indian, Buddhist, and Chinese 
philosophy. Once the introduction 
was effected, translations were 
quickly made of books by Haven, 
Bain, Sully, Wundt, and Ladd: The 
first experimental researches of Japa- 
nese psychologists were performed in 
America and Germany in the last 
decade of the nineteenth century by 
such men as Motora (111), Matsu- 
moto (87, 88, 89, 90, 91), Nakajima 
(116), T. Okabe (142), and Kakise 
(43). 

Yujiro Motora (1858-1912) and 
Matataro Matsumoto (1865-1943) 
were the principal pioneers of Japa- 
nese psychology. Motora was the 
first professor of psychology in Tokyo 
University. He had a philosophic 
mind and tried to establish a system 
of psychology. Matsumoto was pri- 
marily an experimentalist. He de- 
signed the psychological laboratory 
of Tokyo University in 1903, after 
inspecting laboratories in America 
and Germany, and established a 
laboratory at Kyoto University in 
1907. He became the professor of 
psychology at Kyoto in 1906 and 
succeeded Motora at Tokyo after 
the latter’s death in 1913. He advo- 
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cated the study of “psychocine- 
matics," a science of psychophysio- 
logical behavior, and became the 
father of applied experimental psy- 
chology in Japan. He survived 
Motora by 31 years. Most of the 
older leaders of Japanese psychology 
are his students. 

The first 25 years of the twentieth 
century constituted an orientation 
period in the methods of psychologi- 
cal research. Before 1925 studies ap- 
peared under the names of such re- 
searchers as Chiba (16, 17), K. Ma- 
suda (83, 84, 85), Yh. Kubo (64, 65, 
66), K. Tanaka (183, 184, 185), G. 
Kuroda (70, 71), Sakuma (154, 155, 
156, 157), Koga (58), Chiwa (18), and 
Narasaki (119). In 1911, Ohtsuki 
(138) published his Experimental 
Psychology, a book of 1,000 pages, 
which owes its material to Wundt, 
Titchener, and other German and 
American psychologists. The preva- 
lent schools of psychology in Japan 
at this time were Wundt's appercep- 
tion psychology and American func- 
tionalism. Students of structuralism, 
such as Yokoyama (218, 219) and 
Takagi (175), came at the end of this 
period, as did Yatabe, who had just 
returned from Piéron’s laboratory. 

Experimental studies had started 
in the psychological laboratories of 
Tokyo and Kyoto universities near 
the beginning of the century. These 
works were first published as mono- 
graphs, and it was not until the Japa- 
nese Journal of Psychology was 
founded in 1919 at Kyoto by Genji 
Kuroda that a ready means of publi- 
cation became available. The edi- 
torial office was moved to Tokyo in 
1923. 

With the advent of better economic 
conditions after World War I the 
number of educational institutions in- 
creased greatly. Departments of psy- 
chology were established in Tohoku 
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(Sendai), Kyushu (Fukuoka), Keijo 
(Seoul, Korea), Taihoku (Taipei, 
Formosa), Nihon, Hosei, Waseda, 
Keio (the latter four in Tokyo), 
Doshisha (Kyoto), Kwansei Gakuin 
University (Kobe), and the Tokyo 
and Hiroshima Colleges of Arts and 
Science. In addition, many psycholo- 
gists obtained positions in the Koto- 
gakko (essentially junior colleges). 

It is not surprising that the increase 
in Japanese educational facilities was 
accompanied by a quickened interest 
in educational psychology, an inter- 
est that was becoming strongly estab- 
lished by World War I. The mental 
test movement found a home in Ja- 
pan, but many of its technical aspects 
were not developed along the lines 
that prevailed in America. 

Studies of personality did not be- 
come important in number until the 
1930’s, but it is important to note 
that Watanabe wrote a book on per- 
sonality (202) as early as 1912. In 
dustrial psychology was recognized 
by some people as an important field 
of research during World War I, but, | 
probably due to problems peculiar to 
Japan because of overpopulation, it — 
did not develop (nor has it yet de- 
veloped) to the same important posi- 
tion it holds in the United States. 
In clinical psychology, the psychi- 
atric institutes of Tokyo and Kyoto 
universities participated in the men- 
tal test movement and devised psy- 
chodiagnostic methods, but only à 
few individual psychologists, € 
Oguma (133), took an interest !n 
problems of abnormal psychology. 
Kuwata (75) of Tokyo University 
introduced the folk psychology of 
Wundt, and Iritani (36) the soci 
psychology of McDougall. 

In general, it may be said tha 
spite the existence of certain 
tered interests in other areas, psycho i 
ogy in Japan until 1926 was largely 


t de- 
scat- 


PSYCHOLOGY IN JAPAN 


dominated by educational psychology 
and classical experimental psychol- 
ogy. 

One major contribution of great in- 
fluence that was to have an impor- 
tant bearing on later developments in 
Japanese psychology appeared in 
1925. In this year Matsumoto wrote 
his comprehensive Psychology of Intel- 
ligence (90), a book that dealt not 
only with intelligence, but also with 
mental work, senescence, environ- 
mental factors, efficiency, and mili- 
tary psychology. In addition it in- 
cluded a broad commentary on the 
general development of psychology. 
It involved a survey of research done 
by Japanese psychologists under his 
(Matsumoto's) guidance as well as a 
consideration of related work done by 
western psychologists. It was a mile- 
stone in the development of Japanese 
applied psychology. 


From 1926 through World War II 


The beginning of the Showa Era 
(1926) marked the development of 
some important trends in Japanese 
psychology. At that time the influ- 
ence of gestalt psychology was be- 
ginning to be felt; the Japanese 
Psychological Association was es- 
tablished, as was the new series of the 
Japanese Journal of Psychology; and 
the number of graduates in psychol- 
ogy showed a remarkable increase 
due to the expanded system of higher 
ton that developed after World 

ar I. 

Sakuma (157) and Onoshima (146), 
who studied at Berlin University, 
Were the chief expositors of gestalt 
Psychology. Sakuma’s translation of 
Kéhler's Gestalt Psychology (154) is 
Well known. It can safely be said 
that gestalt theory has colored the 
thought of most Japanese psycholo- 
ia from about 1926 to World War 
I (and even now exerts a powerful 
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influence). The years around 1926 
also saw the introduction of the 
works of the Leipzig school (eg., 
those of Krueger and Volkelt) by 
Iwai (40, 41) and others (159) of 
Kyoto University. It would, how- 
ever, be wrong to think that these in- 
fluences were unaccompanied at this 
time by original Japanese contribu- 
tions. Chiba proposed a theory of 
"Eigenbewusstsein," R. Kuroda the 
psychology of “comprehension,” and 
Sakuma a theory of ‘‘basic conscious- 
ness” as directly experienced. These 
contributions may be seen as an at- 
tempt to revise the traditional theory 
of consciousness in the light of some 
characteristics of oriental culture. 
Among books that appeared near 
1926, several deserve particular men- 
tion. K. Masuda’s Introduction to 
Experimental Psychology, a manual 
for beginners, appeared in 1926. The 
same author’s Methodology of Psychol- 
ogy, published in 1934, is an author- 
itative work in general psychology. 
Yh. Kubo compiled two volumes of 
his important Handbook of Experi- 
mental Psychology during 1926 and 
1927, and Kido's General Outline of 
Psychology appeared in 1931. 


Experimental Psychology 


What were some of the evident 
trends in experimental psychology 
from 1926 to World War I1? 

Perception. In the years between 
1926 and 1939, the study of percep- 
tion progressed at a quickened pace, 
usually along lines dictated by gestalt 
psychology. A short enumeration 
of some of the studies undertaken 
may be used to indicate the areas of 
greatest interest. 

The Psychological Institute of Kyu- 
shu University published a series of 
experimental studies on the structure 
of perceptual space directed by 
Sakuma, Yatabe, and Akishige (8, 9, 
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95, 158, 182, 206). Takagi (176) 
compared the structural, phenome- 
nological, and gestalt views of per- 
ception. Obonai (30, 121, 125, 126, 
127) continued (and has to the pres- 
ent time) his studies of psycho- 
physical induction in perception, 
memory, and other areas. (Most of 
the contributions are in the area of 
visual perception, where Obonai has 
tried to formulate laws of induction 
as extensions of the laws of contrast 
and confluence.) 

Matsubayashi (86), an ophthal- 
mologist, performed his classical ex- 
periments on depth in 1937 and 1938. 
A number of investigations of the 
visual constancies were carried out. 
Akishige (8), Ibukiyama (27), and 
others worked on size constancy; 
Yt. Kubo (67) on shape constancy; 
and Ogasawara (130) on the con- 
stancy of phenomenal velocity. Nishi 
(121) studied the moon illusion. 

Studies on the perception of move- 
ment may be represented by the in- 
vestigations of Mukuno (113) on 
tactile apparent movement, S. Kubo 
(63) on visual apparent movement, 
Hisata (27) on auditory apparent 
movement, Fukutomi (21) on delta 
movement, and Ogawa (131) on the 
seen path of real movement. Studies 
on form perception were carried out 
by Morinaga (102), Hayami and 
Miya (25), Yagi (203), and others. 
Studies of the relation between time 
and space (the “tau effect" and its 
reverse) were performed by Abbe 
(1, 2) and by Suto (172). 

At this time Y. Wada (200) per- 
formed experiments on the time error 
for auditory stimuli. Other studies 
in audition were not extensive, but 
Yuki made an important contribu- 
tion when he published A Psychology 
of Tone (221) in 1933. In the field of 
gustatory sensitivity, Rikimaru (151) 
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made a survey oí taste bli 
among Japanese, Chinese, and For 
mosan natives. 

The area of eidetic imagery was 
investigated by Ohwaki and Masaki 
(80) and the former (139) published 
his book Eidetic Imagery of Children 
in 1935. 

The work in perception during this 
period was essentially the product of - 
a relatively small number of Japanese 
psychologists. These psychologists 
were very active and contributed 
number of results which are, unfor- 
tunately, not as wel! known in west- — 
ern countries as they should be. — 

Human learning, caimal learning, 
and thinking. The field of human 
learning was not so intensively iñ- 
vestigated from 1926 to 1939 as was. 
the area of perception. The studies 
that were done were generally per- | 
formed within the framework of 
gestalt psychology. For example, 
Ushijima (197) studied the achieve- 
ment problem (Erfolgsproblem) in 
learning, Amano (11) investigated | 
"memory traces” in the tradition of - 
the Wulf experiment, and problems 
of retroactive inhibition, proactive 
inhibition, and reproductive inhibi- - 
tion were taken up by Sagara (152, 
153) and Maeda (76). ; 

Researches in the field of animal 
behavior were sparse, but a few in- 
teresting investigations were under- 
taken. R. Kuroda published studies 
on the hearing of reptiles as early a$ 
1923. Thereafter he extended his rê- ' 
searches to the monkey, white rat; 
and tortoise. He wrote a Psychology 
of Animals (74) in 1936. Takagi | 
(178) studied the influence of back- 
ground upon the transposition of se- 
lective responses to brightness (in the ^ 
varied tit) and investigated form. 
constancy in the tomtit (177). The 
transposition of selective responses 
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was also studied by Ohtsuka (137) 
and Shirai (164). Odani (128), a psy- 
chologist and psychiatrist, studied 
the effects of cerebral lesions upon 
the visual and auditory discrimina- 
tion habits of white rats. Yamanou- 
chi, a biologist, published his Psy- 
chology of Animals in 1938. Hayashi 
(26), a student of Pavlov, pioneered 
in the study of conditioning. 

The general problem of thinking 
has been of great interest to Japanese 
psychologists. Kido (69), Mitui (96), 
and Tomeoka (189) of Hosei Uni- 
versity performed experimental ob- 
servations on the process of doubt 
and speculation. Sato (160), Abe 
(5), and Amano (12) analyzed the 
process of comparison by introspec- 
tive methods. Sato (160) has extended 
his interest in this general area to 
investigations of developmental as- 
pects of the comparison process. 


Personality 


In the thirties German character- 
ology was introduced into Japan, 
and some experiments (10) were per- 
formed within the framework of this 
Viewpoint. These experiments con- 
Stituted the first systematic work in 
the field of personality. For example, 
Psychologists (7, 187) at Waseda 

niversity performed research in the 
context of Kretschmer's typology, 
and Uchida (191), in particular, de- 
vised a continuous addition test, es- 
sentially a modification of Kraepel- 
ms work. Uchida’s test is now in 
&eneral use in Japan (190). During 
this period, Abe (6), Susukita (171), 
and others, under the guidance of 
Chiba, studied personality differences 
among Manchurian races. Masaki 
and Yoda (82) studied personality 
Tom the viewpoint of educational 
Psychology and wrote A Psychology 
of Character in 1937. 


447 


Educational and Clinical Psychology 

Educational psychology, one of the 
two oldest fields of psychological in- 
vestigation in Japan, showed consid- 
erable progress between 1926 and 
World War II. Studies of develop- 
ment, learning, personality, work, and 
fatigue increased greatly in number 
and had direct influences on educa- 
tion. 

The Institute for Child Welfare of 
Aiikukai and the Institute for Child 
Study in Nihon Women's University 
played particularly important roles in 
researches in child psychology during 
the thirties. Tokyo (with Tanaka, 
Narasaki, Terazawa, and Takemasa 
[120, 181, 183]) and Hiroshima College 
of Arts and Science (with Kubo and 
Koga [57, 64, 66]) were centers of edu- 
cational psychology. Kyoto Uni- 
versity, with Nogami (123) in ado- 
lescent psychology, and Iwai, Kato, 
Sonohara, and Moriya in child psy- 
chology, was a center of develop- 
mental psychology. At this time Yh. 
Kubo (66) wrote his Child Psychology 
under the influence of Charlotte 
Bühler, and Hatano introduced Pia- 
get's work (24) to Japan. The main 
weakness of the child studies carried 
out in this period lay in the fact that 
no provisions were made for large- 
scale follow-up investigations. Inves- 
tigations in the areas of maturity and 
old age were done almost solely by 
Tachibana (174). 

The investigation of children's per- 
sonalities was taken up in the period 
from 1920-1935. Intelligence tests 
and personality tests were not elab- 
orately developed or used in Japan 
until after Ohtomo, a student of 
Judd, wrote his two-volume Diag- 
nostics of Education (136) in the 
period 1928-1933. A revision of the 
Binet-Terman tests of intelligence by 
H. Suzuki (173), then a school in- 
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spector of Osaka City, has been in 
general use in Japan since 1930, and 
group intelligence tests were also 
adapted for Japanese use by Tanaka 
(185), Kirihara (53), Awaji (13), and 
others. The only personality tests 
developed during this period were a 
test of extroversion-introversion by 
Awaji and Okabe (14), and a revision 
of the Downey Will-Temperament 
test by Kirihara. 

In the 1930's psychologists, child 
psychologists, and educationists 
formed groups to study clinical and 
abnormal child psychology. Their 
studies mark the beginning of clinical 
psychology in Japan. Several child 
guidance clinics were established 
about 1930 (in Kyoto City, Kobe 
City, Hyogo Prefecture, Aichi Pre- 
fecture, and Tokyo City). At the 
same time a few clinics were estab- 
lished in prisons, courts, and reforma- 
tories under the Japanese Judicial 
Department and in guidance centers 
set up by the Welfare Department. 
The term clinical psychology was 
never used officially during this pe- 
riod, and the work of the few psy- 
chologists employed in the clinics was 
limited to diagnosis and the evalua- 
tion of intelligence and character. 
The work of the early clinicians and 
the work of the educational psychol- 
ogists overlapped, but their respec- 
tive efforts differed in one respect: the 
former extended their methods to 
meet the requirement of evaluating 
abnormal children and criminals. In 
a few cases the clinicians adopted 
methods typical of psychoanalysis 
(Marui [79] and others). 


Industrial Psychology 


This field, always at a disadvan- 
tage in Japan because of the ever- 
present overpopulation and its con- 
sequence, cheap labor, did not pro- 
gress as rapidly during the 1930's 
as some other areas of psychology. It 
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has been mentioned that, during 
World War I, attention was paid to 
industrial psychology and some sys- 
tematic studies of the labor problems 
were made subsequent to 1926 by 
members of the Kurashiki Institute 
of the Science of Labor, founded in 
1921 by M. Ohara. (The Institute 
moved to Tokyo in 1937 and has con- 
tinued its work until the present 
time. It has included several psy- 
chologists on its staff, Kirihara and 
Ueno [52, 192], for example. The 
Prefectural Institute of Industrial 
Psychology of Osaka was founded 
about 1925. By the middle of the 
thirties the Ministry of Welfare and 
the Government Railways employed 
a number of psychologists for voca- 
tional guidance and the promotion 
of efficiency of labor. There have 
been two journal publications in this 
field: The Science of Labor and Man- 
agement. During the war human re- 
sources were in short supply, and 
many books on the management of 
labor appeared. A considerable num- 
ber of psychologists entered the 
Army and the Navy and took part 
in researches on aptitude testing and 
personnel problems. 


Social Psychology 


Wundt's folk psychology and Mc- 
Dougall’s social psychology were 
studied at an early date by Japanese 
psychologists, but research in the 
social area did not develop for a long 
time. The measurement of attitude 
was first undertaken by Koga (86) 
about 1934, and problems of race 
differences (133) and group psychol 
ogy (46, 47, 97) interested psycho 
ogists from about 1931 to Worl 
War II. During this period, prob- 
lems of national morale were serious’ 
taken up by only a few psychologists, 
due, probably, to the fact that 1n 2 
Japan dominated by the military 


caste, such questions were consider 
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to be political or philosophical in 
nature rather than psychological. 


General Observations 


During the period 1926 to World 
War II, Japanese psychologists were 
self-conscious and self-critical. Their 
science was young, and they were 
passing through a period of self-ex- 
amination and, to some degree, of 
confusion. Matsumoto, president of 
the Japanese Psychological Associa- 
tion in 1933, discussed (91) what he 
considered to be the three weak 
points of Japanese psychology: (a) 
lack of communication among psy- 
chologists, resulting in a decreased ef- 
fectiveness of research; (b) excessive 
use of group methods involving 
questionnaires, tests, etc., and the 
paucity of analytic experimental re- 
searches; (c) excessive speed in adopt- 
ing new, and probably transient, 
western psychological concepts, with- 
out, in all cases, applying appropriate 
historical criticisms and evaluation 
of facts. 

The years near 1935 were the high 
Watermark of Japanese psychology 
before World War II. The number of 
psychological journals was at a peak 
and psychologists were very active. 
The Tohoku Psychologica Folia (Sen- 
dai), the Japanese Journal of Experi- 
mental Psychology (Kyoto), and the 
Acta Psychologica Keijo (Seoul) be- 
came important scientific journals. 
The Japanese Journal of Educational 
Psychology (Tokyo), the Japanese 
Journal of Applied Psychology (Hiro- 
pas, and Animal Psyche (Tokyo), 
faturing articles in a somewhat 
Popularized style, signaled the emer- 
gence and development of new and 
pefiortant areas. Two journals in 
ein related to psychology, Studies 
n the Science of Labor and Beiträge 
ES Psychoanalyse (Sendai) became 
influential, According to a survey by 

akamura and Nagasawa, about 100 
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Japanese works were abstracted in 
the Psychological Abstracts between 
1933 and 1940. 

The number of reports given at the 
fourth meeting of the Japanese Psy- 
chological Association at Sendai in 
1933 was 84; at the fifth meeting at 
Tokyo in 1935, 126; and at the sixth 
meeting at Keijo, Korea, in 1937, 63. 
At these meetings, strictly experi- 
mental work constituted about 40 
per cent of the papers. Sensation and 
perception were the major interests, 
about one in four papers being in 
these areas. 

Psychological research in Japan 
was severely reduced at the time of 
the outbreak of the Sino-Japanese 
conflict in 1937 and its later develop- 
ment into the Pacific war in 1941. 
Several journals could not be con- 
tinued, and at the end of the war even 
the Japanese Journal of Psychology 
could not be published regularly. 
During the long period of strife a 
number of psychologists worked in 
the Navy and the Army, but, in 
general, it may be said that psycho- 
logical research was almost com- 
pletely interrupted. The number of 
graduate students in psychology 
dwindled to essentially zero. 


From the End of World War II to the 
Present 


After World War II a great num- 
ber of changes took place in Japanese 
life, many of them attributable to 
the Occupation. Among other things, 
the changes include a new and liberal- 
ized Constitution and a proposed 
renovation of the Japanese educa- 
tional system. We cannot consider 
here all the factors behind the influx 
of students into universities and the 
influence exerted by the “new” edu- 
cation. It is sufficient to say that the 
number of graduate students in psy- 
chology increased considerably above 
the prewar level, and there was 
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heightened interest in psychology, 
particularly in areas that had not 
been well developed previously. At 
the same time, the Japanese Psycho- 
logical Association greatly increased 
its membership, and its meetings 
were held annually. During the 1951, 
1952, and 1953 meetings, the number 
of reports presented exceeded 300. 
The percentage of the reports that 
could be termed experimental was 
about 30. 

A further discussion of trends may 
here be aided by a subject-by-subject 
consideration of work in the various 
areas. 


General Psychology 


Yatabe wrote his Introductiom to 
Psychology (213) from the standpoint 
of general behavioristics (1950), and 
a Handbook of Psychology in 12 vol- 
umes is now being prepared under 
the sponsorship of the Japanese So- 
ciety of Applied Psychology. A latent 
behavioristic influence has now be- 
come fairly well established. In 
1941 Imada translated Bridgman's 
Logic of Modern Physics (33) into 
Japanese, and a criticism of opera- 
tionism was expressed in the same 
year by Yatabe (209). 


Experimental Psychology 


Handbooks of experimental psy- 
chology have been edited by Takagi 
and Kido, of which Volume I (gen- 
eral methods of experimentation), 
Volume II (vision), and Volume III 
(audition and other sensory experi- 
ence) have thus far been published 
(180). As to methodology, it is 
worth noting that interests in sto- 
chastics and factor analysis have in- 
creased among Japanese psycholo- 
gists since the war, and some re- 
searches are being conducted in these 
areas (Koga [57], Indo [34], Iwahara 
[39] and others). 

Perception. A book on the psy- 
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chology of perception was pub 
by Ogawa, Tanaka, and Osaka (14 
in 1952. Recently, Obonai (127) hi 
surveyed western and Japanese stud 
ies on visual perception. n 
Experiments on perception are, i 
general, maintaining ties with the- 
earlier gestalt influence, now som 
what mixed with behavioristic 
encies. Interest continues to 
shown in problems of space p 
tion generally (for example, 
Akishige [9], Miyakawa [100], O 
[148]), but problems of size and sh 
discrimination (as related to the pé 
ceptual constancies) seem to be pa 
ticularly popular (Kume [68], M 
kino [77], Misumi [95]). Quantitativ 
theories of space, as, for example 
Luneburg's, are received with co 
siderable appreciation. 1 
The topic of figural aftereffects i 
being studied with enthusiasm, 
in the opinion of one of us (CHG 
Japanese studies in this area af 
being done very effectively (e.g 
Ogasawara [130], Azuma [15], O 
[150], and others [30, 123]). Ka i 
is studying the effect of preced 
conditions on retinal rivalry. © 
periments on time errors have 
done by Inomata (35), Nakajima 
(115), Ono (145), and others. Obo- 
nai’s "induction theory” and Y 
kose's psychophysical research (207 
216) on form perception are influen; 
tial contributions. 
Interest in  psychophysiolog 
problems has increased since the 
of the war; this is due, among 0 
influences, to the very original wor 
of the physiologist Motokawa ( 
108, 109) on the electrical excitab 
of the eye. Studies on electrica 
citability are now being carried on 
some psychological laborato: 
They involve studies of the 
course of various color effects, f 
distortions, contrast effects, etc: —. 
Many areas of perception are Dem 
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examined, but it will not do at this 
time to describe the efforts in great 
detail. One trend may be worth men- 
tioning: the increased communica- 
tion of psychologists with physiol- 
ogists and people in related sciences. 

Audition has not become as ex- 
tensive a research interest as visual 
perception. Nevertheless, it is inter- 
esting to observe that Wada (201) 
has recently (1951) published a book 
with the same title as Yuki's earlier 
volume, A Psychology of Tone. A 
recent publication concerns an analy- 
sis of Japanese vowels (110). 

Other sensory systems have not 
been studied extensively since the 
war. 

Animal and human learning; think- 
ing. The greatly increased interest in 
animal experimentation is a major 
postwar development. Many experi- 
ments have been done recently at 
the universities of Tokyo, Kyoto, 
Tohoku, Hokkaido, and Osaka. The 
central problem has been the con- 
troversy between field theory and 
reinforcement theory. Studies have 
been made on the problems of sub- 
goal responses (Umeoka [194] and 
Yagi [204]), latent learning (Nozawa 
[124], Asami [38]), and the effects of 
amount of reinforcement and amount 
of drive. Studies of reasoning (Sue- 
naga [169]), place learning, and ex- 
perimental neurosis (Murata [114]) 
are in progress at this time. It is 
also worth noting that a Society for 
the Study of Behavior Theory has 
been established by psychologists at 
the universities of Kyoto, Osaka, and 
Osaka City. 

Considerable interest also exists in 
problems of lhuman learning. Recent 
studies have been concerned with 
retroactive and proactive inhibition 
(Sagara, Ishiwara [37, Umemoto 
[193], and researches are in progress 
9n questions of temporal factors in 
interpolated materials and of gener- 
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and interpolated lists. Human condi- 
tioning experiments (61, 62) have 
been almost wholly restricted to the 
laboratory of Kwansei Gakuin Uni- 
versity under Kotake. Studies of 
transposition responses in children 
have been performed, and the develop- 
mental aspects of this type of be- 
havior are now being examined by 
Sato, Okano (144), Motoyoshi (112), 
and others. 

Little work is being performed on 
motor skills. 

Recently a book on the psychology 
of learning was published by Yagi, 
Umeoka, and Maeda (205). 

In the period 1946 to 1949, Yatabe 
published A History of Thinking 
(211) and three volumes of his Psy- 
chology of Thinking (212). Volume I 
of the latter work is on concept and 
meaning, Volume II on relations and 
reasoning, and Volume Ill on the 
thinking of animals. These books by 
Yatabe, together with his History of 
the Psychology of Will (210), are 
authoritative and comprehensive. 
They are not as well known outside of 
Japan as they should be. 

Personality. R on personal- 
itv per se has not expanded after the 
war to the same degree as some other 
areas of interest. In this connection, 
however, it is important to observe 
that many investigators are at work 
on experiments that are closely re- 
lated to this area, for example, those 
that deal with the influence of mo- 
tives on discrimination (92, 98, 162). 
In fact, most Japanese experimental 
psychologists show considerable in- 
terest in this topic and experiments 
are being done that bridge the bound- 
ary between studies of personality 
and perception (45). Some psycholo- 
gists (e.g, Kitamura and Imada) 
have recently considered the problem 
of the ego. 


ti 
Recently (1951) one of us (Sato) 
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published A Psychology of Personal- 
ity (161) which discusses the train- 
ing procedures of Zen Buddhism 
against a background of western 
studies. 


Developmental Psychology, Education- 
al Psychology, and Measurement 


The reform of the Japanese educa- 
tional system, instituted by the Oc- 
cupation, involved an important pro- 
gram of teacher education and re- 
education. The re-education program 
has placed great stress on the data of 
educational psychology, and, under 
the new system, many school teach- 
ers have been instructed in this area. 
If widespread knowledge of an area 
is an important contributor to its 
vitality, then educational psychology 
(81, 212) is very much alive in Japan. 
American psychologists who went to 
Japan during the Occupation were 
usually educational psychologists, 
and it may be expected that the re- 
sults of their efforts will show tan- 
gible returns in the way of intensified 
activity in this area. 

Recently several books (22, 212) 
on the developmental aspects of edu- 
cational psychology have been trans- 
lated into Japanese; several books 
have been written on child psychol- 
ogy with an essentially educational 
emphasis, for example, one by Yama- 
shita (206) ; several books on the psy- 
chology of adolescence (Katsura [49], 
Ushijima [198]) have also appeared. 
Little attention has as yet been paid 
to problems of maturity and old age. 

Experimental studies of mental 
development are being carried on, 
notably by Sonohara (167) and 
Nakano (118). Takemasa published 
two volumes on Developmental Psy- 
chology (181) in 1948-1950. The 
large public interest in educational 
and developmental psychology is re- 
flected in the fact that journals in 


KOJI SATO AND C. H. GRAHAM 


this area are edited for the general 
public as well as the scientific. Em- 
phasis is placed on the problem of 
learning and its relation to vocational 
and educational guidance. 

Work continues in the general 
areas of intelligence and personality 
tests in several institutions, most 
notably in the National Research 
Institute of Education. Tanaka, for 
years one of the dominant figures in 
educational measurement, still re- 
mains a leader. 

An emphasis on mental hygiene 
characterizes postwar educational 
psychology, and some psychiatrists 
and psychologists are now working 
together. In Japan, the term ‘“‘men- 
tal hygiene" is used in its broadest 
sense to apply to normal as well as 
abnormal individuals. In student 
counseling work, many problems 
peculiar to Japan arise in a way that 
may have no parallel in the United 
States. In particular, political prob- 
lems are said to provide an "'appar- 
ent" focus of maladjustment in stu- 
dents. In any case the student coun- 
selor has an important role to play 
in modern Japan. 


Clinical Psychology 


The status of clinical psychology 1n 
Japan has changed greatly in the 
postwar period. The name “clinical 
psychology" is now in common use 
and the Society of Clinical Psychol- 
ogy has been formed. A journal of 
clinical psychology (Clinical Psy- 
chology and Educational Counseling) 
has recently been established, and 
now, for the first time, psychoan- 
alytic theories are becoming common 
subjects of discussion.  Projective 
tests, such as the TAT and Ror 
schach, play important roles in diag- 
nosis, and nondirective methods © 
therapy are being studied (55, 106, 
168). In all of this, the influence 9 
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American clinical psychology is felt 
strongly, but in addition, some truly 
Japanese methods, such as Morita's 
(104), which are based upon a syn- 
thesis of Zen doctrines and western 
studies, are regarded favorably. 
Without doubt, clinical psychology is 
developing rapidly as an important 
field of endeavor in Japan. 


Industrial Psychology 


Although industrial psychology in 
Japan has not developed to the same 
degree as in America, it has never- 
theless been shown that a basic core 
of subject matter, technique, and 
practice did exist prior to World War 
Il. During World War II problems 
of training and efficiency became 
serious, and a number of psychol- 
ogists worked in the Army and Navy, 
especially in the selection of aviators. 
Since the war, labor problems (train- 
ing and worker morale, especially) 
have received more attention than 
previously. Several prefectures have 
established research institutions for 
work in this area, and a few universi- 
ties have set up chairs of industrial 
psychology and vocational guidance. 
Work in selection shows little prog- 
ress, for in a land with 85 million 
people and disproportionately in- 
adequate natural resources, such 


Ere seem indeed to be fanci- 
ul. 


Social Psychology 


After the war many social prob- 
lems forced themselves upon the at- 
tention of psychologists. At the same 
time, the theories and experiments 
of social psychologists in America 
flowed into Japan and stimulated the 

apanese workers. In consequence, 
work in social psychology is now be- 
ing carried on at an accelerated pace. 
At the annual meeting of the Jap- 
anese Psychological Association in 
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1952, about 40 studies were reported 
by social psychologists and, if re- 
lated researches are added to this 
total, the number becomes consider- 
ably greater. 

In the seven years since the war, 
the topics of central interest have 
changed. At first, considerable ef- 
fort was spent on the analysis of 
social and cultural phenomena (3, 
32) that occurred during and after 
the war. Later, attention was paid 
to problems of group dynamics in con- 
nection with the democratization of 
the Japanese people. More recently, 
problems of social perception, com- 
munication (28), and the measure- 
ment of attitude became a central 
problem not only for social psychol- 
ogists but for many others as well. 
Most recently, psychologists have 
become interested in systematiza- 
tion (Ikeuchi [31], Suenaga [170], 
and others) Problems have been 
restated, and the possibility of treat- 
ing social behavior mathematically 
has been examined. Several re- 
search groups have been formed, 
for example, the Society for the 
Research in Behavior Mechanisms 
(at Tokyo), the Association for 
Group Dynamics (at Kyushu), and 
the Society for Youth Work (at 
Tokyo). As yet, no special journals 
have been established in social psy- 
chology, but a number of systematic 
treatises have appeared lately, no- 
tably those by Minami (94) and Shi- 
mizu (163). 


OVERVIEW 


Our survey points to the fact that 
psychology in Japan, like psychology 
in most countries, did not develop 
rapidly during the first 40 years of 
its existence. Circumstances im- 
proved after World War I, and after 
1926 psychology advanced in a surer 
fashion. By 1935 it gave promise, not 
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only of becoming stronger but also of 
becoming more diversified. Unfor- 
tunately, its favorable growth was 
severely inhibited by the events that 
led to the Sino-Japanese conflict and 
World War II. In effect, the war 
period became an interval of stagna- 
tion. Following the war, Japanese 
psychology underwent the readjust- 
ment common to all areas of Jap- 
anese life. At the present time, it 
shows signs of development to a new 
and stronger level. 


SOME OBSERVATIONS ON CURRENT 
ASPECTS OF JAPANESE PsvcHOLOGY 


It iscertain that the picture of pres- 
ent-day psychology in Japan differs 
in some details from the one that 
exists in America. The following dis- 
cussion is aimed at acquainting the 
American reader with some impor- 
tant facets of psychological endeavor 
in Japan. It is not intended that the 
discussion will deal with “problems.” 
Rather, it is hoped that the descrip- 
tion of selected areas of difference 
between Japan and America will 
acquaint the western reader with 
some facts that are essential to an 
understanding of psychology in a dif- 
ferent culture. 


Training 


Under the Occupation (1945-1952) 
a strong effort was made to remodel 
the educational system of Japan.? 
All aspects of education felt the im- 
pact of the attempt, and for a while 
the training of psychologists came 
under scrutiny. For purposes of this 
discussion we shall speak of the pre- 
Occupation system as the “old” sys- 
tem and the program suggested by 
Occupation authorities as the “new” 
system. 


*We are indebted to Dr. D. D. Smith, 
Office of Naval Research, Washington, D. C., 
for information on certain educational policies 
of the Occupation. 
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The old system had a number of 
the characteristics of the British 
system, Education leading to grad- 
uate work was based upon the 6-5-3-3 
system: 6 years in primary school, 
5 years in middle school, 3 in pre- 
paratory school, and 3 in the uni- 
versity. On leaving junior college 
(cf. p. 444) the prospective university 
student had to take entrance exam- 
inations which covered many fields; 
each university provided its own 
examination. If a student passed the 
written examination, he was called in 
for an oral examination and an inter- 
view. It is important to notice that 
university study began in the 14th 
school year as in England, not in the 
12th school year as in America. 

During his university days, a stu- 
dent might concentrate in psychology. 
The courses required in the psychol- 
ogy program varied somewhat from 
university to university, but, in gen- 
eral, they seem to have covered the 
conventional range of subject matter 
despite such differences as may have 
existed among, for example, Kyoto, 
Tokyo, and Keio. Kyoto had (and 
has) a requirement in philosophy 
that does not exist at Tokyo, and 
Keio emphasizes work in natural sci- 
ences. 

Occupation authorities made an 
attempt, under the new system, to 
change the method of selecting uni- 
versity students for national univer- 
sities. (This change constituted a 
small segment of a program to recon- 
stitute the educational system into à 
6-3-3-4 sequence, analogous to the 
American scheme: 6 years of primary 
school, 3 of lower secondary, 3 of up- 
per school, and 4 of university.) It 
was planned under the new system 
that university applicants should 
take the same examinations at ap- 
proximately the same time through- 
out the nation. After a student had 
"passed" the national aptitude test; 
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it was planned that he be invited to 
the university of his choice for spe- 
cial examinations prepared by that 
university. If the student passed the 
examinations, he was to be inter- 
viewed. If he did not pass, he was to 
be permitted to have his files sent 
to one more national university. Pri- 
vate universities planned to operate 
differently, but the projected pattern 
was similar in many respects to the 
one here outlined. It turns out now 
that ways of utilizing the national 
aptitude test differ among different 
universities. Some universities use 
these tests as screening devices, and 
some use them only to provide addi- 
tional advisory material. 

Under either the old or new system, 
a student who graduated from a uni- 
versity could enter upon graduate 
work. Under the old system, no spe- 
cial courses were required during the 
period of graduate training. Once the 
undergraduate courses were com- 
pleted the student could apply for 
admission to the dean of the depart- 
ment and to the Faculty of Litera- 
ture (under which the psychology de- 
partment usually exists). Usually, 
as at Tokyo, entrance examinations 
were required only for people who 
came from other universities. Under 
these circumstances it turned out 
that the graduate students in a 
given university had usually been 
undergraduates in the same uni- 
versity. Very often the student 
stayed on as an assistant to the pro- 
fessor for many years.* 

The new program did not say much 
about selection procedures, but it 
did make the proposal that 30 credits 
of course work be required in the first 
Year of graduate study. These cred- 


peo large Japanese universities are, in 

Vent not freely open to graduates of other uni- 

ne. Most scholars remain in the same 
ool from their undergraduate days until 
ey achieve professorial rank. 
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its were to be accumulated in the 
areas of experimental, clinical, social, 
developmental, educational, and cer- 
tain optional areas. So far, the new 
program has not received as much 
support as was expected. In 1953 
about one-third of entering graduate 
students embarked on the new course. 

The main argument against the 
new program is that it is expensive 
both for the university and for the 
student. Under the old plan, gradu- 
ate students often received salaries 
as assistants, and the number of as- 
sistants required in a university par- ^ 
tially determined the number of 
graduate students. Under the new 
plan, a student who has to devote his 
first year to study is not likely to re- 
ceive an assistant's fee.* 

Whatever may be its fate at other 
strata of the educational structure, 
the new system has a "hard row to 
hoe" at the level of graduate work. 
For this reason, it may here be more 
realistic to restrict the present dis- 
cussion of graduate training to the 
old system, unless otherwise specified. 

In general, few examinations are 
given during a man's graduate work, 
and the requirements for reports on 
research vary from university to 
university. At Tokyo and Kyoto a 
research report must be made once a 
year, but at some other universities 
this requirement does not exist, the 
final thesis being the criterion for 
completion of the work. As had been 
said, the major professor decides 


4 Where the new program does exist it has 
often been confusingly intermixed with the 
old program. The result is that few students 
take the graduate courses required by the 
new m to arrive at a possibly somewhat 
higher level than that specified by the M.A, 
degree in America, In the universities of 
Tokyo and Kyoto, for example, the number 
of students entering the graduate courses is 
only four a year. This number, of course, is 
small in comparison with the number of stu- 
dents who receive the master's degree every 
year in America. 
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when a man will finally stop his 
training. A doctorate degree is 
granted, if it is granted at all, usu- 
ally after many years of professional 
work. It is based upon a defense of 
the thesis. 

What opinions do Japanese psy- 
chologists have about ways of im- 
proving graduate training? The fol- 
lowing proposals were frequently en- 
countered and seem to represent the 
prevailing views of the Seminar partic- 
ipants: (a) establish a program lead- 
ing to a terminal doctor's degree ob- 
tainable after a reasonable period of 
graduate study;5 (b) differentiate de- 
partmental offerings into social, ap- 
plied, and experimental psychology; 
(c) appoint more professors in more 
diverse areas of psychology;* (d) im- 
prove the general level of undergrad- 
uate instruction; (e) specify formal 
prerequisites for graduate training in 
each area of psychology; (f) provide 
more adequately supervised work in 
the laboratory and in the clinic; (g) 
improve facilities for research and 
graduate training; (k) place psychol- 
ogy in some other faculty than the 
Faculty of Literature. 

Some of the proposals may be car- 
ried out. It is possible, for example, 
that psychology may vacate its 


* In Japan the M.D. degree is given after 
several years of research. "Bungakuhakase" 
(literally, Doctor of Literature, correspond- 
ing to the western Ph.D.) is not usually given 
until a psychologist approaches his fiftieth 
birthday. 

* Two professorships in psychology exist in 
each of the two most recently founded univer- 
sities, Hokkaido and Osaka. Tokyo Uni- 
versity has only recently obtained a second 
professorship. The universities of Kyoto, 
Tohoku, and Kyushu have only one chair. 
This situation is to be contrasted with the 
one that exists in philosophy. At Kyoto, for 
instance, there are seven chairs of philosophy. 
This ratio of philosophy professors to psychol- 
ogy professors has remained unchanged since 
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place in the Faculty of Literature, 
Other members of the Faculty even 
now accept the fact that psychology 
is misplaced. Certain government 
officials seem to favor the reclassifica- 
tion of psychology, but others do 
not, probably on the grounds that, as 
a recognized laboratory and clinical 
discipline, it will become more ex- 
pensive. 

While external agents are helping 
determine the position of psychology, 
psychologists themselves can do a 
great deal in the way of advancing 
training procedures. They can, for 
example, agree on well-chosen policies 
concerning the effective use of pres- 
ent facilities. (Facilities are, in fact, 
adequate for many purposes, and 
much excellent work can be produced 
with them. In a few universities [e.g., 
Kwansei Gakuin] the equipment used 
in prevailing research is good.) 
Above all, establishing policies aimed 
at producing broadly trained psy- 
chologists, who are skillful in the use 
of research techniques, will constitute 
an important step.* 


Professional Opportunities 


The lot of a psychologist in Japan 
has many desirable features, but, 
financially, it is no more (but prob- 
ably no less) attractive (in view of 
prevailing economic conditions) than 
that of many academic people in 
other lands. A full professor may 
earn $70 to $100 per month; an assist- 
ant professor, $50 to $90; an instruc- 


LU 

1 Laboratory equipment was, in general 
not obtainable during the war. In pagel 
quence much of the equipment, even 1n t d 
relatively new laboratories of Tohoku an 
Kyushu universities, is old. The provision o 
funds for laboratory equipment is not ye 
adequate. À 3 

* See, for general background information, 
an article by M. Imada: Recent psychologi j 
thinking in Japan. Ann. Proc. Dept. Psychol. 
Kwansei Gakuin Univer., 1954, 1, 1-9. 
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tor, $20 to $85; and an assistant about 
$20. (The salaries quoted include 
appropriate allowance for housing, 
etc.) The few jobs available in edu- 
cation, government, and industry 
probably provide slightly higher max- 
imum salaries than the academic 
positions. 

A main flaw in the professional 
picture lies in the fact that profes- 
sional psychology is relatively un- 
developed in Japan. Psychology has 
not been exploited by government, 
education, and industry, as has been 
the case in the United States, and 
few positions exist for psycholo- 
gists outside of universities. Until, 
through the years, a demand is felt 
for psychological work in other places 
than psychology departments, it is 
unlikely that professional opportuni- 
ties will expand greatly. It now re- 
mains for psychologists to exploit 
Circumstances in which the fruitful- 
ness of psychological methods may 
be shown to advantage. 


Problems of Communication 


Psychologists in Japan feel iso- 
lated, and, in fact, they have been 
isolated, not only from western psy- 
chologists but, to a greater degree 
than they desire, from each other. 
Eve isolation from western psy- 

Ologists is attributable to a great 
Many factors, among which geo- 
graphic position plays a minor role. 
E of contact with the western 
Er: at existed from 1937 to 1945 

he undesirable effects, and the 
En years (1945-1952) that 
odit constituted a period of read- 

"A ent that did not provide the 
Bi ances for scientific give- 
Tom e. (During the Occupation a 
milito r of American educational and 

sa Ty psychologists worked with 

pation officials, and, in fact, did 


Stablish good contacts with Japa- 
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nese workers. However, these pleasant 
relations remained indirect in most 
cases, because the Americans usually 
worked in administrative positions. 
It is to be hoped that more oppor- 
tunities for close personal relations 
in the seminar and laboratory, such 
as existed during the Kyoto Seminar 
in Experimental Psychology, will be 
established soon. It is encouraging to 
observe that the appointment of 
George M. Haslerud as Fulbright 
Professor at Kyoto and the visit of 
Koiti Motokawa to the United States 
during 1953-1954 signify steps in this 
direction.) 

The barrier of language has, as 
much as any other factor, served to 
confirm the isolation of the Japanese. 
Despite the fact that Japanese train- 
ing in foreign languages is good, the 
training of western workers in Jap- 
anese is bad. It has been the lot of 
Japanese psychologists since their 
beginnings to do research which re- 
mains little known to the outside 
world. In consequence, Japanese psy- 
chologists have had to depend on the 
approval of their fellow countrymen. 
This situation may certainly have 
been attended by undesirable effects, 
which, fortunately, now seem to be 
disappearing. 

Certain devices for increasing com- 
munication might be tentatively 
recommended without commitment 
as to their ultimate value. For ex- 
ample, Japanese psychologists could 
be encouraged to publish in other 
languages than their own. (It is 
worth observing in this connection 
that, even now, Japanese psychol- 
ogists, feeling a need to converse 
more effectively with their western 
colleagues, usually add extensive 
English abstracts to their research 
reports.) Other devices such as inter- 
cultural seminars, exchange scholar- 
ships, exchange professorships, inter- 


458 


cultural institutions, etc. might pro- 
duce more intimate scientific rela- 
tions. Certainly such programs 
should be encouraged. 

During the war and for some time 
thereafter, Japanese psychologists 
had little contact with each other. 
The Japanese Psychological Associa- 
tion may well ask itselí what meth- 
ods may be used to increase com- 
munication among its members. The 
planning of symposia on topics of 
general interest might be suggested 
as an area in which the Association 
can make effective contributions, and 
there are indications that action will 
be taken along this line. Finally, it 
is worth observing that devices that 
improve intercultural relations can 
also improve contacts among the 
Japanese. One of the chief merits of 
the Kyoto Seminar lay in the fact 
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The purpose of this review is to 
describe the leaderless group discus- 
sion (LGD): its history, applicability, 
method of administration, and relia- 
bility and validity as a technique for 
assessing leadership potential. We 
shall discuss the major variants in 
procedure that have been tried, and 
indicate the effects of these variations 
on LGD reliability and validity. We 
shall not consider the initially leader- 
less discussion’s role as a basic re- 
search tool for studying the develop- 
ment of leadership as it has been 
employed by investigators such as 
Carter (29, 30, 31, 32), Pepinsky, 
Siegel, and Vanatta (61), and Bell 
and French (24), although we will 
make use of their findings wherever 
they shed light on the LGD as an 
assessment device. 


History OF THE LEADERLESS 
Group DISCUSSION 


According to Ansbacher (2), the 
originator of the method was J. B. 
Rieffert, who directed German mili- 
tary psychology from 1920 to 1931. 
he technique, called by first users 
the Schlusskolloquium or Rundge- 
etch, was aimed at showing be- 
àvior "toward equal partners," The 
‘rman Army used the procedure 
E about 1939, while the Navy 
ipntinued employing it in their selec- 
‘on programs until late in World 


1 

es! parts of this article were sum- 
beats for presentation at a Symposium on 
B M Performance Tests, Sixty-first 
tal ^w Meeting of the American Psychologi- 
1 ona Cleveland, Ohio, September 
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War II. Various German civilian 
agencies now appear to be employing 
the LGD as an assessment tool. 

Influenced by the German de- 
velopments in situational testing, by 
1942 the British War Selection Board 
introduced such tests into their bat- 
tery for selecting Army officer candi- 
dates. The basic series of leaderless 
group tests was evolved by Bion and 
included the LGD (41, 45, 70). A 
similar program was established by 
the British Navy (70). 

At the end of the war, the LGD 
was employed by Fraser (39, 40) as a 
device for screening British manage- 
ment trainees, and by Vernon (68, 
69) for testing British Civil Service 
applicants. 

Similar developments took place in 
Australia (43, 65), South Africa (3), 
and Norway.’ 

The OSS Assessment Staff (59) ap- 
pears to have initiated use of the 
LGD in the United States late in 
World War II. American federal and 
state civil service examiners began 
trying out the technique at the end of 
the war (5, 26, 27, 53). 

Approximately 25 per cent of the 
190 Civil Service agencies surveyed 
by Fields (37) reported using the 
LGD. A Federal Civil Service man- 
ual appeared in 1952 (56). The LGD 
has also been employed by several 
American industrial and business 
firms. Its rapid acceptance has led 
both Meyer (58) and Douglas (35) to 
caution about overestimating its 
validity and utility. 

2 Private correspondence from V. C. Jahl, 
Chief Psychologist, Norwegian Armed Forces. 
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We attribute the use of the LGD 
primarily to its relative ease of ad- 
ministration compared to individual 
interviews, especially where large 
numbers of applicants are involved, 
as well as to its face validity com- 
pared to paper-and-pencil tests. 

It may be that the face validity of 
the LGD really is what Gulliksen 
(44) has labeled intrinsic validity. A 
number of exírinsic validity studies 
are now available which may justify 
ex post facto the possibly widespread 
premature acceptance of the LGD as 
a valid measure of the tendency to 
display successful leadership. 

Several investigators have assumed 
that the LGD is intrinsically valid 
as a means of appraising successful 
leadership behavior. Pepinsky, 
Siegel, and Vanatta (61) have used 
LGD performance as a criterion for 
evaluating the effects of counseling; 
Bell and French (24) and Carter and 
his associates (29, 30, 31, 32) have 
assumed that they are studying lead- 
er behavior when observing discus- 
sions. 

Another reason for the continuing 
interest in employing situational tests 
to forecast or appraise leadership 
behavior may be due to the nature of 
leadership and psychometrics. 

In reaction to the earlier emphasis 
on the effects of individual differences 
on leadership behavior, there has 
been, more recently, an emphasis on 
situational effects on leader behavior. 
That both sources of variance are 
important for leadership behavior 
can be accounted for fully only after 
analyzing the main effects and the 
interaction effects of situational dif- 
ferences and individual differences 
in motivation, behavioral history, 
and biological level (maturity, hered- 
ity, and integrity of the CNS). 

* This formulation was origi s 
by W. P. Hurder. MARY aed 
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It follows that any test designed to 
forecast leadership potential and in- 
tended to have some generality of 
application would need to: (a) share 
elements in common with a variety 
of situations; (b) vary directly with 
the situation for which predictions are 
to be made. 

Most psychometric test procedures 
have been developed to meet only the 
first requirement, for many behaviors 
are relatively little influenced by 
situational changes. Thus, one's 
speed-of-arm movement or spatial 
visualization accuracy remains fairly 
constant over a wide range of situa- 
tions. On the other hand, since the 
effects on leadership behavior of situ- 
ational change are large, psycho- 
metric methods like the LGD, and 
other situational tests which meet 
both requirements, would appear to 
offer more promise initially as psy- 
chometric techniques for forecasting 
leadership behavior. ; 

With reference to the first require- 
ment, the LGD, in common with a 
range of other situations in which 
leader behavior is to be appraised or 
predicted, appears to share such ele- 
ments as the need for would-be 
leaders: to communicate effectively, 
to overcome inertia, to solve various 
interaction problems, to meet dead- 
lines, and to reach consensus. 

With reference to the second re- 
quirement, the LGD and other situ- 
ational tests, by their very construc: 
tion and administration, tend to S 
consistently with the nature of the 
examinees and the real-life situations 
for which personnel are being chosen. 
Candidates for positions of leader: 
ship are assessed among administra- 
tors by observing them solving & : 
stracts of administrative problem 
among administrative trainees. E 
of the same individuals for position 
as Army officers would involve 
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studying them solving military prob- 
lems during interaction with officer 

| candidates. To some extent, there- 
fore, situational tests may be able 
to take situational variations into 
account. 


APPLICABILITY OF THE LEADER- 
LESS GROUP DISCUSSION 


According to published reports, 
the LGD has been used to assess can- 
didates for many professions and 
occupations. The examinees have 
included: officer candidates (2, 41, 
43, 45, 70, 71); OSS agent applicants 
(59); advanced Naval (31, 32), Air 
Force, and Army ROTC cadets (15); 
industrial management trainees (39, 
40, 65); industrial executives and su- 
pervisors (21, 22); shipyard foremen 
(53); supervisory labor mediators 
(42); civil service supervisors and 
administrators (3, 54, 68, 69); ap- 

| plicants for foreign service (68, 69); 
graduate engineer trainee applicants 
(9, 10); sales trainee applicants (9, 
10); public health physicians (5, 26, 
27); teachers (47); visiting teachers 
(51); and social 
| 


service workers 
(52). 


DESCRIPTION OF THE LEADERLESS 
Group DISCUSSION 


The basic scheme of the LGD is 
to ask a group of examinees, as a 
group, to carry on a discussion for a 
given period of time. No one is ap- 
Pointed leader, The examiners do not 
Participate in the discussion, but 
Temain free to observe and rate the 
Performance of each examinee. To 
date, there has been little standard- 
lation by all examiners of the num- 
E of discussants per group, the 
k gth of testing time, type of prob- 

ms, if any, presented to the candi- 
ates, and the directions given to 
3 em. Also, the number of raters has 
atied, as has the seating arrange- 
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ment. Examiners have differed in 
the kinds of behavior they have ob- 
served and rated and in the extent 
to which their ratings have been at- 
tempts to describe the behavior they 
have observed, rather then attempts 
to make inferences about the per- 
sonality of the candidates. 


Factors Rated by Observers 


Unless otherwise indicated, the 
LGD results we shall describe are 
either based on observers' ratings of 
the amount of successful leadership 
displayed! in the discussion, or are 
inferences about the personalities of 
the candidates based on observations 
of this behavior. 

In regard to personality inferences, 
Couch and Carter (34), following a 
factorial analysis of observers' infer- 
ences based on several kinds of situ- 
ational tests including the LGD, 
found that three independent factors 
could be isolated which accounted 
for situational test ratings. The fac- 
tors and the ratings with highest 
loadings on each factor were: (a) 
Individual Prominence (authoritari- 
anism, confidence, aggressiveness, 
leadership, striving for recognition); 
(b) Group Goal Facilitation (efficien- 
cy, cooperation, adaptability, pointed 
toward group solution); (c) Group 


“A leadership act is said to occur when 
member A of a group behaves in a way di- 
rected toward changing another member, 
B's, behavior. More specifically, a leadership 
act occurs when A's behavior is directed to- 
ward: (a) changing the intensity and/or di- 
rection of B's motivation, and/or (b) re- 
structuring B's abilities to cope with the 
situation and reduce B's needs (13). 

‘All attempted leadership acts in which A 
reaches his goal of changing B are consi: 
successful leadership. If B's change in be- 
havior brought about by A's successful leader- 
ship leads to need satisfaction for B and for A 
(apart from A's satisfaction in being a success- 
ful leader), A's successful leadership act is 
considered effective (46). 
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Sociability (sociability, adaptability, 
pointed toward group acceptance). 

As will be pointed out later, many 
of these specific inferences by obser- 
vers, such as those concerning au- 
thoritarian tendencies, may be valid 
only as descriptions of performance in 
leaderless groups and actually may 
be inversely related to attitudes and 
performance in real life.5 

A similar factorial analysis of OSS 
situational test data by Sakoda (62) 
uncovered three factors which re- 
semble Couch and Carter's: (a) 
Physical Energy (energy and initia- 
tive, physical ability, leadership); (b) 
Intelligence (effective intelligence, 
observing and reporting, propaganda 
skills); (c) Social Adjustment (social 
relations, emotional stability, ‘‘secu- 
rity"). Carter (29) noted that similar 
factors appeared in several studies 
when real-life leader behavior was 
rated and factor analyzed. 

In aiming to estimate successful 
leadership behavior, the descriptive 
check lists of behavior in leaderless 
discussions that evolved from vari- 
ous studies made by the author and 
associates appear to concern primar- 
ily what Hemphill (46) has labeled 
"Initiation of Structure," a factor 
which has emerged in several of the 
Ohio State Leadership Studies (e.g., 
38) and which has similarities to 
Couch and Carter's "Group Goal Fa- 
cilitation." 

In the check list used in most stud- 
ies by the author and his co-workers 
to assess leader behavior, raters are 
asked to indicate whether each candi- 
date showed the following behaviors 
“a great deal" (4 pts.), "fairly much" 
(3 pts.), "to some degree" (2 pts.), 
"comparatively little” (1 pt), or 
"not at all" (0 pts.): (a) Showed initi- 

š For a discussion of leadership and au- 


thoritarianism, the reader is referred to Hol- 
lander (48). 
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ative; (b) was effective in saying what 
he wanted to say; (c) clearly defined or 
outlined the problems; (d) motivated 
others to participate; (e) influenced the 
other participants; (f) offered good 
solutions to the problem; (g) led the 
discussion. The sumof ratings received 
on all seven items is the examinee's 
performance score. 

In an unpublished study by Pruitt 
and the author, seven items were 
rated which aimed at assessing the 
extent to which each LGD partici- 
pant showed “consideration for the 
welfare of his associates"——a factor 
of leader behavior uncovered in vari- 
ous Ohio State Leadership Studies 
(38). This factor is similar to Couch 
and Carter's Group Sociability and 
Sakoda's Social Adjustment factors. 
The seven items included: (a) En- 
gaged in friendly jokes and com- 
ments; (b) made others feel at ease; 
(c) complimented others; (d) helped 
others; (e) encouraged others to ex- 
press their ideas and opinions; (f) had 
others share in making decisions with 
him; (g) helped settle conflicts. At the 
same time, a list of seven "Initiation 
behaviors were observed and rated. 

The intercorrelation between 80 
LGD participants’ Initiation an 
Consideration scores was .78, too 
high to warrant continued use of 
both assessments. More significantly, 
the mean rating on any single Initia- 
tion behavior was 2.2 points, while 
the mean rating on any single Con- 
sideration item was 0.75 points. 
Thus, although raters could asses 
reliably both types of leader behavior 
(r,,=.90, .85), much less Considera- 
tion behavior appeared, and most 0 
it was exhibited by Initiators.. 

On the basis of this, and evi 
to be presented later, we suspect thà 
the LGD rating is more an assess 
ment of the tendency to initiate struc 
ture in an initially unstructur 


social situation—one of several types 
of successful leadership behavior— 
than an assessment of tendencies to 
be considerate. 


Conditions Affecting LGD Leadership 
Ratings and. Performance 


A number of variations have been 
systematically studied which may or 
may not seriously affect ratings of 
successful leadership in the LGD. 
These include: the size of the group, 
the seating arrangement, pretest 
coaching, the motivation of the par- 
ticipants specific to the situation, and 

member participation. 
Effects of size. As the size of the 
group increases from two to twelve, 
the mean LGD rating assigned is 
reduced approximately 50 per cent. 
Eighty-three per cent of the variance 
in ratings where members come from 
groups of 2, 4, 6, 8, and 12 is ac- 
counted for by size according to a 
Study of 120 examinees by the au- 
thor and Norton (19). It appears 
that the opportunity to display suc- 
cessful leadership is closely associated 
with the size of the group. We con- 
clude that proper correction must be 
made in any LGD studies where dif- 
erent examinees have been tested in 

Soups varying in size. 
Effects of location of seat and seating 
arrangement, Sixty-eight discussions 
among 467 participants were ana- 
Yzed by the author and Klubeck 
(16) to determine the effects of the 
Breuer seat a participant held on 
B co rating he obtained. For 
V-shaped arrangements and 
ER in which members sat in paral- 
Em facing each other, members 
s at the ends obtain slightly 
Sher mean scores. In two sets of 
E. Seven studies included in this 
" n the results were significant 
> e1 per cent level. The effects 
ed to disappear when variations 
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in the real-life esteem of the members 
were held constant. At any rate, the 
differences, statistically significant or 
not, associated with participants’ 
location in the group were too small 
to be of much practical concern. 

Pretest coaching. If LGD perfor- 
mance can be greatly altered by 
means of brief coaching which does 
not reflect any real change in per- 
sonality, its routine use by large 
organizations such as the Armed 
Forces for screening OCS applicants 
would be impossible. Therefore, 
Klubeck and the author (50) briefly 
coached the third highest and the 
sixth highest participants (among 
seven) of each of 20 leaderless dis- 
cussions and then ran retests on each 
group of seven. An analysis of co- 
variance led to the inference that 
while those who were fairly high in 
LGD score initially profited signif- 
icantly from coaching, those who 
were initially low did not profit at all 
from the brief coaching. While the 
shift upward of the high ranking 
subjects was statistically significant, 
it was not very large in an absolute 
sense. 

The investigators cited a number 
of reasons for rejecting the inference 
that the differential improvement was 
due to differential motivation, and 
concluded that LGD behavior is a 
function of personality traits and 
needs which cannot be altered readily 
by brief coaching. l 

This interpretation was in agree- 
ment with Harris’ (45) opinion that 
one could not “cram” for the leader- 
less group discussion. Harris sug- 
gested that priming a candidate, 
rather than help would most likely 
handicap him by "inhibiting his 
spontaneity.” 

In an unpublished study, Pruitt 
and the author gave the same part- 
directive, part-permissive coaching 


470 


as Klubeck and the author had ad- 
ministered previously, but the groups 
receiving training were coached for 
15 minutes while assembled as groups 
prior to undergoing the LGD.  Di- 
rectly in line with Harris' hypothesis, 
the five trained groups, each with six 
participants, showed significantly 
less “initiation” behavior than eight 
untrained control groups of six each, 
The trained groups exhibited only 
half as much “consideration” be- 
havior as the untrained groups. Rat- 
ers commented on the "freezing up" 
and the “increased nervousness and 
tensions" which characterized the 
trained groups; this was in line with 
Harris’ hypothesis. 

As pointed out by Klubeck and the 

author: 
- -^. long term training is obviously an entirely 
different matter. Thus, if an ineffective indi- 
vidual underwent psychotherapy successfully 
which led to favorable modifications in his 
needs and self-esteem, there is no reason to re- 
ject the possibility that he would exhibit im- 
proved performance on the LGD, but here the 
LGD would reflect real personality change 
(50, p. 71). 

Actually, Pepinsky, Siegel, and 
Vanatta (61) carried out such long- 
term training with some success. 

Effects of exirinsic motivation of 
participants specific to the situation. 
Do momentary changes in the in- 
centive to participate, unrelated to 
the personality of the participants, 
make much difference in LGD per- 
formance? An unpublished study 
completed by the author tentatively 
suggests that added extrinsic motiva- 
tion is relatively unimportant in 
determining the behavior of LGD 
participants. Two small samples of a 
total of 31 college students were 
tested and retested in leaderless 
groups of seven to nine. The first 
sample was told that the first test 
was mere practice and had no bearing 
on class grades, but that the second 


BERNARD M. BASS 


test would count as an im 
grade determiner. The sec 
ple took the "important" 
tion first and was then given’ 
ond test as a “check-up on th 
that would not affect 
means of successful leade: 
played in both the motivate 
motivated situations were pra 
identical. Moreover, the corr 
between LGD performance ii 
situations was .86, indicati! 
the addition of incentives al 
performance relatively and ab 
ly very little. 

Needs more basic to the pe 
ity probably energize and st 
LGD behavior. These needs a 
to be either openly denied oi 
scious to a great extent. i 
ROTC cadets, in an unpubli 
study by the author and € 
were asked to indicate on a five: 
scale how much they tried to d 
well as possible on an LGD, ac 
ation of only .30 was found 
reported effort and actual. 
scores obtained. E 

Analogous to performance - 
paper-and-pencil aptitude tests; 
creasing the extrinsic moti 
mature subjects does not serve 
raise scores materially, since | 
jects usually perform near 1 


centive. Of course, we mig 
ceed in lowering performance 
could sufficiently discourage si 
or increase tension beyond so 


be no more sensitive to va ia ; 
examinee’s extrinsic motivatio, 
most aptitude tests. It is 
that the LGD is less affected b 
extrinsic motivation às 
to obtain a job than is the’ 
disguised personality test. 
What is needed are com 
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studies of operational, as opposed to 
experimental, validities of the LGD, 

Amount of participation. Analyses 
generally find a high correlation be- 
tween the sheer amount of talk of 
LGD participants and the scores they 
earn for successful leadership. Time 
spent talking correlated .65 with rat- 
ings of success for 64 sales and man- 
agement trainee candidates (10), .77 
for 140 sorority girls (23), .93 for 
20 college students (6), and .96 for 
36 college students of an unpublished 
study by R. L. French.* 

This high correlation is disturbing 
at first, since it suggests that LGD 
ratings primarily discriminate the 
verbose from the terse. However, 
the relationship can be shown by a 
series of deductions to logically fol- 
low if we assume that almost all 
participation in the LGD is attempted 
leadership behavior, that LGD rat- 
ings are assessments of successful 
leadership behavior, and that at- 
tempted leadership must occur in 
order for some of it to be judged suc- 
cessful. More detailed discussions 
of these relationships are presented 
elsewhere (6, 13). 

„Kind of participation. Qualitative 
differences appear in the kinds of 
LGD participation engaged in by the 
Successful leader and by those who 
Participate and attempt leadership 
acts, but who nevertheless earn low 
Scores as successful leaders. When 
the responses during leaderless dis- 
cussions of 46 fraternity members 
Were analyzed by the Bales technique 
| (4), other judges' ratings of partici- 
| p» successful leadership corre- 
ated .66 with frequency of attempting 
answers; .50 with frequency of posi- 
We socioemotional responses; .44 


: FRENcn, R, L. Verbal output and leader- 
ad Status in initially leaderless discussion 
ind) Amer. Psychologist, 1950, 5, 310. (Ab- 
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with frequency of asking questions; 
and .32 with frequency of negative 
socioemotional responses (12). The 
roles associated with successful lead- 
ership were  initiator-contributor, 
opinion-giver, elaborator, compro- 
miser, orienter, evaluator, energizer, 
encourager, etc. (11). Only one type 
of participation, attempting answers, 
was associated (r =.48) with the real- 
life esteem of the participants, 

In a study of 40 ROTC cadets, 
Carter, Haythorn, Shriver, and Lan- 
zetta (32) found that LGD leaders 
were much more likely than non- 
leaders to diagnose the situation, ask 
for expressions of opinion, propose 
courses of action for others, support 
and defend their proposals, give in- 
formation, express opinions, and ar- 
gue with others. 


RELIABILITY OF THE LEADERLESS 
Group DISCUSSION 


The reliability of the LGD will be 
considered in two phases: rater agree- 
ment and test-retest reliability. 
Wherever possible we will indicate 
factors that systematically influence 
these reliability estimates. 


Rater Agreement 


Table 1 displays the average agree- 
ment found between any two ob- 
servers in rating the first LGD ad- 
ministered to the designated sub- 
jects. The results suggest that the 
refined 7-item check list and its 
predecessors of 9 and 14 items used 
in most of the studies by the author 
and associates (e.g., 15) yield a con- 
sistent rater correlation of between 
.82 and .84. LGD scores based on 
two raters using the check list method 
yield a satisfactory estimated relia- 
bility of .90 or above. 

A number of factors influence the 
agreement between raters. These 
will be considered next. 
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TABLE 1 
REttiaBILITY or LGD RATINGS ESTIMATED BY THE AVERAGE AGREEMENT | 
BETWEEN Any Two RATERS 


Average Reli 
" Correlation : 
Subjects What Rated Hating | between p: 
Any Two Using Tee d 
Observers ng 
| 
sales and mgt. trainee | Desirability for | Paired com- .67 .80 
candidates (9) position parison 
2. 219 administrative trainee | Personality and | Graphic rat- .70 .82 
candidates (3) ability (high- ing scale 
ly intercorre- 
lated) 
3. 36 male college students | Leadership per- | Graphic rat- .61 75 
(30) formance ing scale 
4. 40 NROTC subjects (31) Leadership, au- | Graphic rat- .53 .70 
thoritarian- ing scale 
ism, initia- 
tive, and in- 
sight (highly 
intercorre- 
lated) 
5. 20 mixed college students | Successful lead- | Check list .67 .80 
(6) ership dis- (13-item) 
played 
6. 46 fraternity members (20) | Successful lead- | Check list .82 .90 
ership dis- (14-item) 
played 
7. 120 mixed college students | Successful lead- | Check list .82 .90 
(19) ership dis- (9-item) 
played 
8. 244 ROTC cadets (15) Successful lead- | Check list .83 9i 
ership dis- (9-item) 
played 
9. 88 fraternity pledges (72) | Successful lead- | Check list .84 .91 
ership dis- (9-item) 
played 
10. 140 sorority members (23) | Successful lead- | Check list .83 .91 
ership dis- (7-item) 
played 
11. 48 mixed college students* | Initiation of Check list .90 95 
structure and (8-item) 
interaction 2 
12. 48 mixed college students* | Consideration of | Check list .85 9 
others (8-item) 


* Unpublished data. 


Effects of discussion effectiveness. A 
pronounced curvilinear relationship 
was found between the rated effec- 
tiveness of 67 leaderless discussions 
and the correlation between two ob- 
servers’ ratings for each discussion. 


Rater agreement was highest for 
discussions of average effectiveness 
and was lowest for discussions whic 
were either extremely effective 0T 
extremely ineffective. The average 
eta found for four subsamples of these 
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61 discussions was .68 (17). It may 
be that observers become too inter- 
ested in the very effective discussions 
and too bored or detached from the 
very ineffective ones; or, the variance 
of LGD ratings may be reduced in 
discussions at either extreme of the 
distribution of effectiveness, which, 
in turn, will serve to reduce the relia- 
bility of the ratings. 

Effects of size. The number of 
participants in a given discussion ap- 
pears to influence the extent to which 
observers will agree with each other. 
The author and Norton (19) tested 
five samples of 24 subjects each in 
discussions of two, four, six, eight, 
and twelve in size. Maximum agree- 
ment among observers was reached 
in groups of six (r=.89) and was 
lowest in groups of two (r=.72). 
Carter, Haythorn, Meirowitz, and 
Lanzetta (31) found that mean ob- 
server agreement was only .70 for 
four-man groups, while it was .85 
When the same men were retested in 
tight-man groups. 

Effects of test or retest. The last 
Tesults cited can also be accounted 
for, in part, by the fact that observer 
agreement appears to increase when 
the same subjects are retested. Thus, 
When the 120 subjects of the author 
and Norton were all retested, ob- 
Server agreement increased for groups 
of all sizes from a mean correlation 
of .82 to a mean correlation of .90. 

Effects of the object of rating. The 

ata summarized in Table 1 sug- 
st that rater agreement is higher 
Where raters employ a check list in 
which they merely indicate the extent 
to which each of a number of items 
of leader behavior was exhibited by 
‘ach candidate, rather than where 
E €y employ a single graphic rating, 
t where they attempt to make in- 
"ences about the standing of the 
‘xaminees on intervening variables 
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of personality or ability supposedly 
underlying the LGD behavior being 
observed. The median estimated 
reliability of check list ratings is .90; 
the median estimated reliability of 
other types of ratings is .80. 
Test-Retest Reliability 

The seven available test-retest 
reliability coefficients and the studies 
on which they are based are listed 
in Table 2 in order of the size of the 
coefficients. Listing them by size 
leads to the inference that test-retest 
reliability is higher the more similar 
the test and retest situations, The 
consistency of LGD behavior is high- 
er the less group membership changes 
from test to retest; the less the prob- 
lem changes; the less some members 
are increased in ability to lead; the 
less some members are increased in 
“real-life” status; the less observers 
are changed; and the less time be- 
tween tests increases, permitting 
more random or biasing change to 
occur among participants. These 
results conform to the principle of 
consistency, proposed by the OSS 
Assessment Staff, that a subject will 
respond to similar environmental con- 
ditions in a similar manner (59). 

Where changes in situation from 
test to retest are reduced to a mini- 
mum, a high test-retest reliability 
is found. It is probable that where 
behavior check lists describing par- 
ticipant behavior are used, the true 
test-retest reliability of the LGD is 
somewhere between .75 and .90. 

Effects of size. The number of 
participants in a group appears to 
determine the consistency of the be- 
havior from test to retest. For, while 
the author and Norton (19) found 
groups from four to twelve in size to 
have an average test-retest reliabil- 
ity of above .90, groups of two had 
a corresponding reliability of only 
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TABLE 2 


TEST-RETEST RELIABILITY OF RATINGS OF LEADERSHIP DISPLAYED IN LEADERLESS 
Group Discussions RANKED IN ORDER OF SIMILARITY OF TEST AND RETEST 


a E aa 


Differences between | Interval EG 
Subjects Rating Method Test between | C la- 
Situations Tests ti 
ions 
1, 120 mixed college stu- Check list None Week 90 
dents (19) (9-item) 
2. 31 mixed college stu- Check list One “important | Week 86 
dents* (9-item) exam,” other 
not 
3. 140 sorority members Check list Two members of | Three 15 
(23) (7-item) each group of 7 hours 
given training 
between tests 
4. 25 mixed college stu- Participants | Groupsrearranged, | cek .75 
dents (24) rank each six retests 
other 
5. 20 mixed college stu- Checklist Intervening LGD | Six .12 
dents (6) (13-item) among leader or weeks 
followers only 
before second 
test 
6. 23 mixed college stu- Check list Group rearranged, | Two days 58 
dents (21) (7-item) type of problem to week 
systematically 
varied: case his- 
tory, no problem 
presented, and 
leader specifica- 
tion to be out- 
lined 
7. 172 ROTC cadets (15) Check list Groups rearrang- | Year .53 
(9-item) ed; some sub- x 
8. 36 male college stu- Rating scale 


dents (30) 


jects changed in 
"real-life" status 
more than others 
between tests; 
different raters 

Six intervening 
situational tests 
in groups of two 
before second 

` test 


Three to .39 
four 
months 


RM E aculeo S ce lPaua 0. 1| 5 2 al RENS 


* Unpublished data. 


46. These results suggest that two- 
man and probably three-man leader- 
less group discussions should not be 
used for assessment purposes. 


VALIDITY OF THE LEADERLESS 
GRnouP Discussion 


The construct validity of the LGD 


will be examined in this discussion. 
This requires both logical and empiri- 
cal review. r 
Figure 1 diagrams the relation- 
ships among a group of variables 0 
importance in the study of leader- 
ship. Using a set of postulates ba! 


primarily on learning theory, the 
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NS OF ABILITY TO 
GROUP'S PROBLEMS 


CURECTION OF DETERMINATION... 
INTERVENINO. VARIABLE nd 
OBSERVABLE OR DIRECTLY MEASURABLE VARIAGLEC—> 


Fic. 1. RELATIONS AMONG LEADERSHIP 
VARIABLES 


author elsewhere (13) has deduced 
these relationships, which may be 
summarized as follows: 


1. The more a member js able to solve the 
group's problems because of personal charac- 
teristics (such as his capacity, achievement, 
responsibility, and participation), the more 
likely he is to exhibit successful leadership be- 
havior in real life, in quasi-real situations, and 
in the leaderless group discussion. These per- 
sonal characteristics are reflected in perform- 
ance on various psychological tests of intelli- 
gence, proficiency, and personality. 
2. The more a member exhibits successful 
kadership, the higher is his esteem among his 
associates—the extent to which he is regarded 
of worth as a member or leader to the group, 
Tegardless of his position—and the higher will 
the merit ratings he receives as a successful 
der or member. The higher his esteem, the 
More likely he is to be of further success as a 
leader among his associates. 
E. The higher a member's status, as inferred 
s his rank or the worth of his position 
ong his associates, the more likely he is to 
Successfully lead his associates. 


Further relations between vari- 
Be noted in Fig. 1 can be ignored 
a ratings of LGD performance 
E actually valid measures of tend- 
Be of individuals to differ in 
A cessful behavior, and if we accept 
% a logical rule that variables with 
E determinants should cor- 
€ positively with each other, 
en following the outline of Fig. 1, 
Scores should correlate: 
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1. With real-life rank (and hence real-life 
status) when the LGD is among associates of 
different ranks; 

2. With real-life merit ratings (hence real- 
life esteem); 

3. With leadership performance in other 
quasi-real situations; 

4. With observations and indices of success- 
ful leadership performance in real-life situa- 
tions; 

5. With personal characteristics as meas- 
ured by psychological tests and measurements 
commonly associated with success as a leader, 


Any positive correlation between 
an LGD rating and these other speci- 
fied measures should provide partial 
evidence of validity of the LGD per- 
formance rating—namely that it 
actually measures leadership poten- 
tential or individual differences in 
tendency to be successful as a leader. 
Previously presented evidence indi- 
cates that the measurement of LGD 
performance is consistent with itself. 
The question still to be answered is 
whether or not it is consistent with 
the various other measurements as- 
sociated with, described as, or de- 
fined as successful leadership be- 
havior. Rated LGD performance 
should be associated with these other 
measures if it is an assessment of 
success as a leader. 

We will now survey empirical in- 
vestigations of the extent to which 
rated LGD performance was found 
associated with status and esteem in 
real-life, personal characteristics and 
leadership performance elsewhere. 


Status (as Estimated by Rank) and 
LGD Performance 


A biserial correlation of .88 was 
found between the rank in the com- 
pany of each of 131 oil refinery super- 
visors and their success as LGD 
leaders among their associates (22). 
The more the discussion problem con- 
cerned matters for which they had 
rank over their associates, the higher 
was this correlation (21). A corre- 
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sponding correlation of .51 was found 
for 264 ROTC cadets. The lower 
correlation in ROTC probably re- 
flected the fact that rank differences 
were less vital to the cadets than to 
the industrial executives. 

When 180 ROTC cadets were re- 
tested among their associates a year 
after an original test, those who rose 
during the year from cadet noncom 
to cadet first lieutenant or higher 
gained significantly more in LGD 
score on the retest compared to the 
test than those who received promo- 
tions to cadet second lieutenant only.* 


Esteem as Estimated by Real-Life 
Merit Ratings and LGD Per- 
formance 


Table 3 lists 17 correlations be- 
tween LGD performance and esteem- 
in-real-life as estimated by merit 
ratings. It also shows when and how 
the ratings of merit were obtained. 
The median correlation is .39 and 
is raised to .51 when only the seven 
cases in which correction was made 
for the unreliability of esteem rat- 
ings are considered. As shown in 
Table 3, LGD scores have been found 
moderately predictive of merit as 
an ROTC cadet officer (15), sorority 
or fraternity member (20, 23, 72), 
civil service administrator (3, 69), 
shipyard foreman (55), foreign serv- 
ice administrator (69), and OCS 
cadet officer (71). The moderate cor- 
relations between LGD scores and 
real-life esteem for the studies that 
involved discussions among strangers 
suggest that a common source of 
variance among examinees, which 
exists beyond the effects of situation, 
underlies an examinee’s merit among 
his real-life associates and his success 


* Bass, B. M., & Coates, C. H. Situa- 
tional and personality factors in leadership in 
ROTC. Unpublished manuscript. 

* See footnote 7. 
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as a leader among strangers, T 
relationship between esteem 
LGD score cannot be attributed sole- 
ly to the tendency of an examinee 
display successful leadership 
associates who esteem him. 
the LGD appears to assess a 
of the examinee that are not sp 
either to the test situation or to t 
group in which he is tested.” 
Variables related to the ce 
between esteem and LGD perfe 
The total amount of successful leat 
ership displayed in a leaderless : 
cussion appears to reflect the ave 
merit as leaders elsewhere of all 
participants of the discussion. 
analysis of 67 LGD's among fri 
nity pledges, sorority members, and 


? Another interpretation of these 
has been offered by E. L. Kelly in 
Annual Review of Psychology (Stanford, 
Annual Reviews, Inc., p. 295). Kelly 5 
that to some extent, the real-life merit 
may react in the same way to the same 
irrelevant to leadership success, as v 
LGD raters. Both assessments are in 
ment, but the source of agreement does not 
necessarily concern leadership potent . 
Thus, in a given situation, real-life merit 
raters and LGD raters both may tend to as 
sign high ratings to thin men, and logic and 
the literature on the subject suggest that 
thinness should have no relation to leadership 

tential. 3 
TN counterargument is as follows: The real- 
life merit ratings, biased as they are, are 3 6 
function of the extent to which the ra 
value or esteem the rates. The eval 
tends to have consequences affecting. the con- 


evaluation. If seemingly irrelevant cues’ pori 
as thinness influence merit ratings in with 
life, they will then tend to be wie 
esteem and leadership potential. d 
to the same biasing cues, the LGD observers 
are in error as far as logic or psychologists d 
concerned, but, despite this, their error 
ciated with real-life leadership potential * 
well as with the biased real-life merit ra 
This same counterargument will not à 
where merit ratings have no future © 
quences on the success of the actual pel 
ance being rated, such as in the case © 
praising diagnostic proficiency of ph: 
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ROTC cadets found a correlation of 
-35 between the mean LGD score per 
discussion group and the mean es- 
teem-in-real-life score of the partici- 
pants of each group. Thus, there 
was a between-groups as well as a 
within-groups positive correlation be- 
tween LGD ratings and esteem as 
estimated by merit ratings. In the 
same way, the LGD scores within a 
discussion correlated .20 with the 
merit scores among the participants 
of that discussion (17). 

These results suggest that LGD 
ratings which depend solely on stand- 
ards within a group discussion should 
suffer in validity as estimated by cor- 
relations with real-life merit ratings. 
Rating techniques with this disad- 
vantage include the forced distribu- 
tion rating, paired comparison, rank- 
ing, and any others which force the 
equalization of all discussion-group 
LGD score means and/or variances. 
Similarly, when tested among strang- 
ers, any ratings of each other by the 
participants themselves will be at- 
tenuated in validity as predictions of 
esteem, since they will depend solely 
upon standards based on observation 
of a single discussion. 

This same analysis (17) indicated 
that a number of variables are associ- 
ated with the variation from discus- 
sion to discussion in the correlation be- 
tween real-life esteem of the members 
and their LGD performance. Accord- 
ing to a Doolittle solution, these 
variables included: within-discussion 
variance in real-life esteem; within- 
discussion variance in LGD ratings; 
and group size. All relationships were 
positive except for size, Six-man 
groups were slightly more valid than 
larger ones as predictors of real-life 
esteem. 

Status differences among members 
almost completely invalidate the 
LGD as an indicator of real-life es- 
teem. Yet above and beyond these 
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effects, the case history discussion 
appears more likely than other types 
of discussions to reflect differences 
in real-life esteem and personality, 
Where members of different rank 
were tested together, the case history 
discussion was the only one which 
yielded scores that correlated posi- 
tively with merit ratings as refinery 
supervisors (r=.28). Furthermore, 
case history discussion performance 
correlated .54 with a supervisory 
aptitude test battery. Other types of 
discussion, in these circumstances, 
averaged .23 in correlation with 
supervisory aptitude test scores, and 
correlated negatively with ratings of 
esteem of these supervisors of differ- 
ent rank (21). 


Personal Characteristics Associated 
with Leadership and LGD Per- 
formance 


According to Stogdill’s survey of 
over one hundred studies, leaders 
tend to surpass nonleaders in certain 
personal characteristics such as capat- 
ity (intelligence, alertness, verbal 
facility, originality, and judgment), 
achievement (scholarship and knowl- 
edge), responsibility and associated 
personality factors (dependability, ini- 
tiative, persistence, aggressiveness, 
self-confidence, desire to excel), and 
participation (activity, sociability, 
cooperation, and adaptability) (63). 
If these characteristics can be in- 
cluded under the broad concept 9 
“abilities to solve group problems, 
then their relationship to leadership 
can be deduced as well (13). Perform- 
ance in the LGD should be associ- 
ated with these personal factors, if it 
is to be judged valid as a measure 9 
individual differences in tendency to 
exhibit successful leadership e: 
havior. Table 4 shows the corre! 
tions obtained between various meas 
ures of capacity and/or achieveme? 
and LGD performance. Table 
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TABLE 4 
CORRELATION BETWEEN LGD PERFORMANCE AND MEASURES OF CAPACITY 
AND ACHIEVEMENT 
Factor(s) Most 
Subjects Test of Capacity or Probably Measured | Corre- 
ees Achievement by Test or lation 
Measurement 
1.140 sorority members (23) | ACE Linguistic Verbal aptitude 
2. 66 fraternity pledges (72) | ACE Linguistic Verbal aptitude 
3. 64 salesand management | OSPE Verbal aptitude 
trainee candidates 
(10) 
4.140 sorority members (23) | ACE Quantitative Numerical aptitude 
5. 66 fraternity pledges (72) | ACE Quantitative Numerical aptitude 
6. 180 ROTC cadets* ACE Total Verbal and numerical 
aptitude (intelli- 
gence) 
7. 202 administrator candi- | South African Air | Intelligence 


dates (68) Force test of men- 
tal alertness 
8. 131 oil refinery supervi- | Otis Gamma Verbal, spatial, numer- 
sors (22) ical, aptitude (intel- 
ligence) 
9. 123 foreign service candi- | Cognitive Test Bat- | Intelligence 
dates (69) tery 
10. 202 administrator candi- | Cognitive Test Bat- | Intelligence .20 
dates (69) tery 
11. 123 foreign service candi- | Civil Service Qualify- | Intelligence and scho- .30 
dates (69) ing Exam lastic achievement 
12. 202 administrator candi- | Civil Service Qualify- | Intelligence. and scho-| .20 
dates (69) ing Exam lastic achievement 
13. 140 sorority members (23) | Years of education Scholastic achievement | .18 
14. 131 oil refinery supervi- | Years of education Intelligence and scho- | .57f 
sors (22) lastic achievement 
15.180 ROTC cadets Average grade in col- | Scholastic achievement | -16 
ege b z 
16. 140 sorority members (23) | Average grade in col- | Scholastic achievement | .31 
ege : 
17. 131 oil refinery supervi- | Supervisory aptitude | Supervisory aptitude :301 
sors (22) test x 
18. 66  mixedcollege students | How Supervise? Supervisory knowledge | ..46 


* Unpublished data. 


T Contaminated by the correlation between rank and test of measurement. 


1 Not affected by holding rank constant. 


shows the correlations between LGD 
Performance and various personality 
Variables that approximate the “‘re- 
Sponsibility" and- “participation” 
clusters of Stogdill. The first three 
items of Table 7 may be regarded as 
Urther evidence of the correlation 
between participation and LGD per- 
.ormance. We shall briefly consider, 
in turn, the correlations between 


LGD performance and capacity, 
achievement, responsibility, and par- 
ticipation. d 
Capacity and. leaderless group dis- 
cussion performance. While the cor- 
relation between LGD performance 
and verbal aptitude averages .30 
or above, as might be expected, the 
correlation between LGD behavior 
and numerical aptitude is below .20. 
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Intelligence test scores measuring 
verbal, numerical, and spatial fac- 
tors tend to fall in between in cor- 
relation with LGD performance. 
Ability to solve the leaderless dis- 
cussion group's problems appears to 
depend more on verbal than on other 
aptitude factors. 

A factor analysis of 14 leadership 
and ability measures by the author 
and Coates'? found that while rated 
performance in initially leaderless dis- 
cussions correlated close to zero with 
one factor, Ability in Active Situa- 
tions, it correlated .44 with another 
factor, Ability in Verbal Situations. 
Sakoda's (62) analysis of OSS data 
also noted that discussion ratings 
fell into a cluster with other ratings 
based on ''verbal" situations. Car- 
ter, Haythorn, and Howell (30) ar- 
rived at the same conclusion follow- 
ing factorial analyses of several cri- 
teria of leadership. These findings 
may indicate the boundaries to the 
range of real-life situations in which 
leadership behavior can be forecast 
by rated LGD performance. 

The 10 available correlations be- 
tween intelligence and associated 
aptitude test scores and success in 
the LGD appear consistent with ex- 
pectations. Yet, verbal aptitude only 
accounts for 10 to 15 per cent of the 
variance in LGD, and so cannot be 
regarded as a more easily adminis- 
tered substitute for predicting success 
as a leader. 

Achievement and leaderless group 
discussion performance. Six studies 
are available of the correlation be- 
tween LGD ratings and tested 
achievement, years of education, or 
grades in college. Again, the correla- 
tions are uniformly positive, ranging 
from .16 to .31, with a median of .25. 
If the unusually high correlation of 
.57 is ignored because it is contami- 


10 See footnote 7. 
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nated by the correlation of rank with 
both education and LGD success, the- 
median becomes .20. 

Above and beyond the effects of 
rank, a correlation of .30 appears to 
exist between LGD performance and 
supervisory aptitude as measured by 
an optimally weighted battery of 
interest, biographical, and super 
visory judgment tests. 

How Supervise?, a fairly widely 
used test of knowledge of principles of 
supervision, which aims to predict 
success as an industrial supervisor, 
correlates .46 with the LGD success 
of 66 college students (64). 

Personality and leaderless group dis- 
cussion performance. We shall now 
consider correlations obtained be 
tween LGD performance and the 
specific personality traits found to be 
associated with leadership according 
to Stogdill. , 

Stogdill cites five studies in which 
leaders are found to be more energetic 
than nonleaders. For the LGD, @ 
correlation of .15 was found between 
general activity and energy as as- 
sessed by the Guilford-Zimmerman 
Temperament Survey for 76 sorority 
girls. A corresponding correlation 
.12 was found for 66 college students 
(64). Also, LGD leaders were more 
often characterized in a Rorschac 
analysis as highly energetic, while 
LGD nonleaders more often were de- 
scribed as lazy or passive (18). 

Stogdill uncovered a large number 
of studies that found successful 
leadership associated with original- 
ity, soundness of judgment, and abil- 
ity to evaluate situations. Rorscha! 
analysis characterized LGD leaders 
as strongly imaginative, strongly 
interested in details, and able to "D 
the larger aspects of things, and LG 
nonleaders as stereotyped or conven- 
tional in thoughts and perceptions, 
as unclear, plodding, and confu 
thinkers, and as unimaginative. 
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study of 172 ROTC cadets found an 
eta of .30 between the F scale of 
authoritarianism and LGD perform- 
ance. Highly authoritarian—hence 
stereotyped and rigid—personalities 
did extremely poorly in the discus- 
sion, while equalitarian—but not too 
equalitarian—cadets earned the high- 
est LGD scores. Among samples of 
100 and 67 of these same cadets, 
Pearson correlations of .32 and .33 
were found between LGD perform- 
ance and  Thurstone's Concealed 
Figures and Gestalt Completion tests 
—tests of perceptual flexibility.™ 
Self-assurance and absence of 
modesty were uniformly associated 
with leadership in 17 studies cited 
by Stogdill. Similarly, self-esteem 
as measured by 140 self-nominations 
for sorority leadership positions cor- 
related .29 with LGD performance. 
An analysis of interviews with nine 
LGD leaders and nine LGD non- 
leaders from a total of 140 subjects 
using the Who-Are-You technique 
(28) showed that compared with non- 
leaders, LGD leaders more frequently 
regard in a more favorable light them- 
selves, their effects on others, and 
other persons’ effects on them. 
Discrepancies between Stogdill’s 
conclusions concerning leadership in 
general and LGD personality corre- 
lations arise when we consider per- 
Sonality characteristics such as emo- 
tional stability, sociability, ascend- 
*ncy, and responsibility. 
Eleven studies have found emo- 
tional stability to be associated with 
leadership, although the evidence is 
not uniformly positive (63). Simi- 
arly, LGD performance correlated 
20 and .17 with emotional stability 
as measured by the Guilford-Zimmer- 
Man inventory (18, 64). Rorschach 
analysis likewise characterized more 
GD leaders than nonleaders as 


" See footnote 7. 
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emotionally stable. Overt social ad- 
justment based on peer ratings cor- 
related .28 with LGD performance, 
On the other hand, five correlations 
found between LGD performance and 
the inventoried traits of freedom 
from hypersensitivity and hostility 
ranged from —.40 to .29 with a medi- 
an of —.04 (18, 64). 

Responsibility was uniformly re- 
ported by Stogdill to be associated 
with leadership. But a correlation 
of —.29 was found for 47 ROTC 
cadets between LGD ratings and re- 
sponsibility as measured by the 
Gordon Personal Profile, while cor- 
relations of .19 and .26 were found 
in two analyses of the relations be- 
tween thoughtfulness as assessed by 
the Guilford-Zimmerman and LGD 
performance (18, 64). 

Evidence concerning the relation 
between extroversion, ascendency, 
and leadership in general is contra- 
dictory according to Stogdill (63). 
Ascendency as measured by the 
Guilford-Zimmerman Temperament 
Survey correlated .44 with LGD 
performance of 76 sorority girls (18) 
and .25 with LGD performance of 
mixed college students (64); but as- 
cendency as measured by the Gordon 
Personal Profile and the A-S Reaction 
Test correlated only .02 and —.02, 
respectively, with LGD performance. 

Sociability as measured by the 
Guilford-Zimmerman in two studies 
(18, 64) correlated .27 and .31, re- 
spectively, with LGD performance; 
similar correlations between cooper- 
ativeness and LGD success were .14 
and .13, while sociability as measured 
by the Gordon Personal Profile cor- 
related close to zero with LGD per- 
formance. 

Finally, while Stogdill found ambi- 
tion and desire to excel important 
attributes of leadership, an average 
correlation of —.05 was found be- 
tween LGD performance and ex- 
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TABLE 5 


CORRELATION BETWEEN LGD PERFORMANCE AND PERSONALITY TESTS OR MEASUREMENTS 


1. 76 
2. 66 
3. 20 
4. 172 
5. 100 
6. 67 
7. 140 
8. 18 
9. 76 
10. 66 
11. 140 
12. 49 
13. 66 
14. 47 
15. 49 
16. 66 
17. 49 
18. 66 
19. 76 


Subjects 


sorority members (18) 
mixed college students 
(64) 


sorority members 
highest and lowest 
on LGD (18) 


ROTC cadets (15). 


ROTC cadets* 
ROTC cadets* 
sorority members (23) 


sorority members 
highest and lowest 
on LGD (18) 

sorority members (18) 


mixed college students 
(64) 


sorority members (23) 
sorority members (18) 


mixed college students 
(64) 


ROTC cadets* 


sorority members (18) 

mixed college students 
(64) 

sorority members (18) 

mixed college students 
(64) 


sorority members (18) 


Personality Test or 
Measurement 


Guilford-Zimmerman 
Temperament Sur- 
vey—G 

Guilford-Zimmerman 
Temperament Sur- 
vey 

Rorschach 


UCPOC F scale 


Concealed Figures 

Gestalt Completion 

Self-nominations for 
leadership positions 
in sorority 

W-A-Y interview 


Guilford-Zimmerman 
Temperament Sur- 
vey—E 

Guilford-Zimmerman 
Temperament Sur- 
vey—E 

Peer ratings 

Guilford-Zimmerman 
Temperament Sur- 
vey—F 

Guilford-Zimmerman 
"Temperament Sur- 


vey— 
Gordon Personal Pro- 
file—H 
Guilford-Zimmerman 
Temperament Sur- 
vey—O 
Guilford-Zimmerman 
Temperament Sur- 


vey 
Guilford-Zimmerman 
‘Temperament Sur- 


Guilford-Zimmerman 
Temperament Sur- 


vey— 
Guilford-Zimmerman 
Temperament Sur- 


Trait Probably 
Measured 


General activity and 


energy 


General activity and 
energy 


Imagination, strongly 
interested in details, 
able to see larger as- 
pects of things vs. 
conventionality, ster- 
eotypy, etc. 

Authoritarianism, ri- 
gidity 

Perceptual flexibility 

Perceptual flexibility 

Self-esteem 


Self-esteem 


Emotional stability 


Emotional stability 


Overt socialadjustment 
Friendliness, agreeable- 
ness 


Friendliness, agreeable- 
ness 


Freedom from hyper- 
sensitivity 

Freedom from hyper- 
sensitivity 


Freedom from hyper- 
sensitivity 
Cooperativeness 


Cooperativeness 


Sociability 


Corre- 
lation 


AS 


12 


.60 


.32 
.33 


14 


27 


vey—S 
Lor = se da ee eS Em 
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TABLE 5— Continued 


Personality Test or 
Measurement 


Trait Probably 


Subjects Miseired 


Guilford-Zimmerman 


20. 66 Mixed college stu- Sociability 1 
dents (64) Temperament Sur- 
vey—S 
21. 47 ROTC cadets* proce Pro- | Sociability .07 
le—! 
22. 47 ROTC cadets* oes Personal Pro- | Responsibility —.29 
le—R 
23. 49 sorority members (18) | Guilford-Zimmerman | Thoughtfulness, re- 19 
Temperament Sur- flectiveness 
vey—T 
24. 66 mixed college students | Guilford-Zimmerman | Thoughtfulness, re- .26 
Temperament Sur- flectiveness 
vey—T 
25. 76 sorority members (18) | Guilford-Zimmerman | Ascendency Ad 
Temperament Sur- 
vey— 
16. 66 mixed college students | Guilford-Zimmerman | Ascendency .25 
(64) Temperament Sur- 
vey—A 
272. 47 ROTC cadets* Gordon Personal Pro- | Ascendency .02 
file—A 
28. 66 mixed college students | A-S Reaction Test Ascendency —.02 
64) 
29. 140 sorority members (23) | Ratings of motiva- Motivation to lead —.05 
tion for various of- 
ices j 
30. 46  uncoached college stu- | Check list descriptions | Parental initiation —.08 
dents* 
31. 30 coached college stu- | Check list descriptions | Parental initiation .19 


dents* 

32. 46  uncoached college stu- 
dents* 

33. 30 coached college stu- 
dents* 

34 39  uncoached college stu- 
ents* 

29 coached college stu- 
ents* 

22 coached college stu- 
dents* 

22 coached college stu- 
dents* 


d 
35, 
d 
36. 
37, 
38, 
39, 
40. 66 mixed college students 


Check list descriptions | Parental consideration —.07 


Parental consideration .07 
Social knowledge .16 
Social knowledge —.19 


Accuracy of estimated | —.07 
self-ratings of others 
Accuracy of estimated .36 
group ratings of self 
Accuracy of estimated 19 
group ratings of 
others 
Masculinity-femininity | —.02 


Check list descriptions 


Kerr Empathy Test 


Kerr Empathy Test 


Dymond Empathy 
Test (modified) 
Dymond Empathy 
Test (modified) 
Dymond Empathy 
dents* Test (modified) 
Guilford-Zimmerman 
Temperament Sur- 


22 coached college stu- 
49 sorority members (18) 
Masculinity-femininity | —.01 

(64) 


* Unpublished carie 
sul curvilinear Va B Maximum LGD success experienced by equalitarian—but not too equali 
o se AE LI eT ek positive. correlation existed between the self-esteem of inter- 


.$ Chi-square analysis s 
Viewees and their LGD scores, 
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pressed desire to hold sorority or 
university student offices (23). 

It may be inferred that some con- 
sistency exists between Stogdill's 
generalizations of the relations be- 
tween leadership in general and such 
personality traits as energy, flexibil- 
ity of judgment, and self-esteem, and 
the relations between these traits and 
LGD performance. Contradiction or 
lack of uniformity appears when we 
consider such traits as responsibility, 
emotional stability, ascendency, and 
sociability." 

Participation and LGD perform- 
ance. According to Stogdill, leaders, 


12 Part of this lack of uniformity may be 
due to variations among techniques used to 
measure the various personality traits, and 
variations in the sex composition of the sam- 
ples studied. 

The responsibility, ascendency, sociability, 
and freedom from hypersensitivity scales of 
the Gordon Personal Profile, a forced-choice 
personality inventory, administered to sam- 
ples of men only, tended to correlate zero or 
negatively with LGD success. Corresponding 
Guilford-Zimmerman scales with the same or 
similar names—of the traditional self-report 
type— tended to correlate positively with the 
LGD success of a sample of women only (18) 
and with the LGD success of mixed men and 
women (64). 

Assuming that Gordon's forced-choice pro- 
cedure is less subject to distortion than the 
Guilford-Zimmerman, and assuming that the 
measurement techniques rather than the sam- 
ples were a significant source of variance, one 
might speculate that LGD performance tends 
to be associated with a participant's concept 
of himself, but only where the participant is 
free to distort the description to suit himself. 

To go one step further, it may be that LGD 
success is related to the way a participant 
likes to see himself, but not to the way he 
actually sees himself when forced to make dis- 
criminations over which he has less control. 

Neither of these self-inventoried evaluations 
actually meets the requirements of the orig- 
inal hypothesis that the more able member is 
more likely to be esteemed and to be success- 
ful as a leader. The crucial evaluations of 
ability for leadership and esteem are those 
based not on self-evaluation, but on other 
group members’ judgments of the participant, 
or on objective tests of personality (as con- 
trasted with inventories), 
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in general, tend to be more talkati! 
more industrious, and more likeli 
participate in group activities, 
same appears to be true of 
scorers on the LGD who, comp 
with low scorers, were found to 
1.5 times as many responses in a 
personal interview, and to give 1.6 
as many responses to the Rorsel 
(18. LGD success also has been 
found, for 140 sorority members, to 
correlate .36 with the number 
university student leadership 
tions held per semester, and 
with the number of sorority po 
held per semester (23). 
Parental leadership and LGD { 
formance. It is expected that chil 
hood experiences and memories 
them should play a significant 
in adolescent or adult leadership 
havior (13). In an unpublished stud 
the author hypothesized that LG! 
performance would be a function 
the participants’ perception of how 
they had been led by their parents. 
Examinees used modified Ohio State 
Leadership Studies behavior 
lists (38) to describe the extent 
which their mothers and fathers 
ated structure for them and were co 
siderate of them. The average 
range of correlations between Sul 
descriptions and LGD performa 
suggested that no consistent 
tionship existed between pare 
descriptions and LGD performa 
Empathy and LGD perfo 
A series of studies (e.g., 33, 67) 
found a moderate relationship 
tween empathic ability and leadi 
ship. The results have not been 
formly positive, mainly because t! 
measure of empathy has varied gre 
ly from one investigation to anoth 
Elsewhere (13), the author has de- 
duced that the more a person Cà! 
accurately estimate the meeds 
others, the more likely he is to! 
cessfully lead others. Chowdry 
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Newcomb (33) have come closest to 
testing this hypothesis, and have 
obtained positive results, 

Kerr and Dymond Empathy test 
data collected on a small sample 
(N's-22 to 39) of students by 
Stolper (64) were correlated by the 
author with LGD performance 
scores, as shown in Table 5. The cor- 
relation of .36 between accuracy of es- 
limations of group ratings of self on the 
Dymond test and LGD performance 
was somewhat artifactual since LGD 
leaders probably led in the Dymond 
test also, and all members of a group 
are likely to agree more closely on 
leaders’ ratings compared to non- 
leaders, according to an earlier study 
by the author (6). 


Leadership Performance in Other 
Quasi-Real Situations and LGD 
Performance 


According to Fig. 1, leadership in 
other quasi-real situations is governed 
by the same individual factors as 
performance in the LGD. Therefore, 
the two should correlate fairly highly 
with each other if successful leader- 
ship is being measured in these other 
situations, and if ratings of LGD per- 
formance are valid as leadership rat- 
ings. Similar deductions can be 
made about LGD performance and 
Success as a leader in real life.” 

Table 6 summarizes the 17 avail- 
able correlations between leadership 
or “desirability for leadership posi- 
tions" ratings based on LGD's and 
other situational tests. The correla- 
tions are almost uniformly highly 
Positive, but many are contaminated 

ause the same raters were used to 
assess candidates during the LGD 
and the other situations. 
; The median correlation between 
interview and LGD ratings is .70. 


? The same alternative interpretation and 
ciu apply here as are presented in foot- 
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Where ratings are made by different 
raters in each situation, the correla- 
tion drops to .45 (9). 

Correlations of .37 and .67 were 
found between ratings based on as- 
signed leadership and leaderless dis- 
cussion (3, 59). Correlations between 
the LGD and other verbal situational 
tests, such as debates and commit- 
tee work, are almost as high as be- 
tween the LGD and its retest, aver- 
aging .70. As the other situations 
involve less discussion and more me- 
chanical or athletic activity, the me- 
dian correlation between the LGD 
and these other situational tests is 
reduced to around .52. 

Leadership ratings based on the 
leaderless group discussion alone 
correlated .64 with the final leader- 
ship ratings based on the entire OSS 
battery of situational tests (59). 


Leadership Performance in Real-Life 
Situations and LGD Performance 


Table 7 summarizes the correla- 
tions between LGD performance and 
real-life leadership performance.“ 
While LGD ratings correlate some- 
what with the tendency to hold lead- 
ership offices, they appear to be 
associated more with the tendency in 
real life to initiate structure and inter- 
action among associates and sub- 
ordinates (r=.32). On the contrary, 
a low negative correlation exists be- 
tween LGD performance and the 
tendency in real life to be considerate 


4 An attempt has been made in this article 
to treat separately from the more numerous 
studies of LGD performance correlated with 
merit ratings of real-life performance (Table 
3), studies of the LGD correlated with objec- 
tive indices of real-life leadership performance 
or fairly nonevaluative descriptions of real-life 
leadership performance. While usually em- 
pirically related, merit rating of performance 
is considered conceptually independent of 
actual performance or descriptions of per- 
formance. 
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TABLE 6 


AND IN THE LGD 


"3 


: Other Quasi-Real 
Subjects Situational Tests 
1. 223-442 OSS applicants (59) Interview 
2. 123 foreign service candi- | Interview 
dates (69) 
3. 202 administrator candi- | Interview 
dates (69) 
4. 64 sales management Interview with dif- 
trainees (9) ferent raters I 
5. 14 shoe factory trainee | Interview and tests | Desirability for job 
executive — candi- 
dates (65) 
6. 223-442 OSS applicants (59) Assigned leadership | Leadership 
7. 219 administrator candi- | Assigned leadership | ‘Personality and 
dates (3) ability" 
8. 223-442 OSS applicants (59) Debate Leadership 
9. 223-442 OSS applicants (59) Brook Leadership 
10. 223-442 OSS applicants (59) Construction Leadership 
1t. 40 NROTC subjects (31) | Leaderless Mechani- | Leadership 
- cal Assembly—4- 
man 
Leaderless Mechani- | Leadership 
cal Assembly—8- 
man 
Leaderless Group Leadership 
Reasoning Task— 
4-man 
Leaderless Group Leadership 
Reasoning Task— 
8-man 
12. 123 foreign service candi- | Other situational Desirability for job 
dates (69) tests such as com- 
: mittee work 
13. 202 administrator candi- | Other situational Desirability for job 
dates (69) tests such as com- 
mittee work 


of the welfare of subordinates and 
associates (r = —.25).15 


Studies of Factors Associated with 
LGD Performance 


Two factorial studies analyzing 
LGD test and retest scores in a cor- 
relation matrix of real-life, situational 
test, and psychological test meas- 
ures are available. The first (23) 
was based on 41 measurements of 
140 sorority girls; the second con- 
cerned 14 measures made of 66 to 244 


35 See footnote 7. 


loadings of the LGD test and re 
on the factors that accounted. 
most of the variance of the LGD. 
In line with propositions 
earlier, ratings based on 
formance appear to assess th 
to which an individual i 
structure or is socially bold 
biguous situations. Esteem, 
sonal worth, and verbal ab 
also involved. On the basis 
studies it may be inferred ! 


16 See footnote 7. ^" 
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TABLE 7 
CORRELATION BETWEEN LGD PERFORMANCE AND REAL-LIFE BEHAVIOR 
= 
Subj Method of 
Subjects Measurement 


1. 140 sorority girls (23) 


Extent of extracurricu- 
lar activities 


Participation in social 
cafe 
Real-life leadership per- 


2, 140 sorority girls (23) Number of university „36 
wr ip positions formance 

3. 140 sorority girls (23) Number of sorority lead- | Real-life leadership per- .10 

i i formance 

4. 140 sorority girls (23) Visibility among associ- 24 
well enough to rate ates 

5, 180 ROTC cadets* Final cadet rank achieved | Future successasalead- | .27f 

6. 133 ROTC cadet officers* | Ratings of subordinates —.28 
and peer associates 

7. 133 ROTC cadet officers* | Ratings of subordinates 2 
and peer associates 

8. 32 sales trainee candi- | Number of leadership 1 


dates (9) 
ously 


* Unpublished data. 
Eta (curvilinear 


positions held previ- 


tionship). 
Significant difference at io). cent level between LGD score of experienced leaders and those without ex- 
perience jers. 


exist three independent sources of 
variance underlying leaderless group 
discussion performance: 
I. Tendency to Initiate Structure 
II. Tendency to Be Esteemed or 
of Value to a Group 
III. Ability in Verbal Situations. 
Little specific variance is left when 
the common variance accounted for 
by these factors is extracted. 


SUMMARY 


The history, applicability, relia- 
bility, and validity of the leaderless 
group discussion as a means of assess- 
ing variations among persons in the 
tendency to exhibit successful leader- 
ship behavior have been considered. 
While the procedure was originated as 
a psychological technique in Ger- 
many over thirty years ago, it is 


TABLE 8 


ORTHOGONAL Factors CORRELATED 


Correla- 


tion i 
Sorori- 
ties 
(23) 


Factor 


wirH OVER-ALL LGD PERFORMANCE 


Correla- 
tion in 


Factor 
ROTC* . 


I. Leadership Potential (Esteem) — .24 | I. Esteem or 
I. renew dics EN .39 Il. Tendency to Initiate Structure .53 
UI. Verbality “31. | IIL Ability in Verbal Situations 44 


* Unpublished data. 
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only in the last decade that system- 
atic reliability and validity studies 
have appeared. 

High interrater agreement and 
high test-retest reliabilities have been 
reported consistently, especially 
where descriptive behavior check 
lists have been used as the rating 
technique. 

Group size, length of testing time, 
type of problem presented, directions, 
seating arrangement, number of rat- 
ers, and rating procedure influence 
to a greater or lesser extent per- 
formance in the LGD as well as the 
reliability and validity of LGD rat- 
ings. Studies of the effects of many 
of these have appeared since 1950. 

According to both deductive and 
inductive evidence, a valid assess- 
ment of the tendency to display 
successful leadership should correlate 
with: (a) status as measured by rank, 
when the assessment is based on per- 
formance among associates of differ- 
ent rank; (b) esteem in real life as esti- 
mated by merit ratings; (c) successful 
leadership performance in other 
quasi-real and real-life situations; (d) 
personal characteristics as measured 
by psychological tests, such as capac- 
ity, proficiency, responsibility, and 
participation. On the whole, ratings 
based on performance in the LGD 
tend to do this. Therefore, it is in- 
ferred that they have some validity 
as assessments of the tendency to 
display successful leadership, i.e., 
leadership potential. 

Other evidence suggests that the 
successful leadership behavior ob- 
served in the LGD concerns primarily 
initiation of structurerather than con- 
sideration of the welfare of others. 

The preceding analysis, coupled 
with recommendations made by 
others working in the field of situa- 
tional tests, such as Weislogel (71), 
leads us to the following hypotheses: 
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1. To maximize the reliability and 
validity of the LGD and other situa- 
tional tests, scoring techniques should 
minimize reliance on the ability of 
observers to infer differences in per- 
sonality traits and future tendencies 
among examinees. Observers should 
merely report or evaluate the im- 
mediate behavior they observe. For 
example, in an unpublished study, 
the author found that two Army 
colonels' estimates of the potential 
as Army officers of ROTC examinees 
were less valid as predictors of the 
merit ratings of the examinees than 
were the colonels' check list descrip- 
tions of who initiated structure dur- 
ing the LGD. Similar results were 
noted in a study of fraternity mem- 
bers reported by the author and 
White (20). 

When the observer makes an infer- 
ence about the future behavior of an 
examinee from the observations of 
the examinee during the LGD, 
several potential errors are likely. 
The observer may err in deciding on 
which dimensions to make inferences; 
he may err in collating his observa- 
tions with the future behavior to be 
predicted; and finally, the dimen- 
sions on which the inferences are 
made may be private ones which 
cannot be shared with other obser- 
vers, The errors may be constant, 
variable, or both. Lack of knowl- 
edge and control over such errors 
disappears when raters are merely 
asked to describe what they observed 
and these descriptions are used as 
predictors. 

Further reduction of uncontrolled 
raters’ errors may be made in the 
following ways: vj 

a. Objective criteria for describing 
specific behaviors can be used e 
In the LGD, the actual number 0 
times a participant suggests a neW 
approach to a problem can be not 
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instead of rating “to what extent did 
the participant suggest new ap- 
proaches to problems.” 

b. Forced-choice check lists can 
be used instead of present check lists. 
Otis’ (60) recent successful applica- 
tion of the forced-choice technique to 
interviewer ratings indicates promise 
for applying the same procedure to 
the LGD. 

2. To maximize validity, problems 
that are equally ambiguous to all 
participants, and that require the 
initiation of structure for their solu- 
tion, should be used. Where interest 
is in forecasting leader behavior in 
real life, the structure to be set up 
should approximate the real-life set- 
ting as much as possible. 

3. Since the LGD correlates fairly 
highly with most other intellectual 
or verbal situational tests, the use of 
many situational tests in a battery to 
forecast leadership potential is of 
doubtful utility. Thus, leadership 
tatings based on a one-hour LGD cor- 
related above .60 with leadership 
assessments based on the three days 
of OSS situational testing (59). A 
similar correlation between the LGD 
and an entire battery of situational 
tests was found by Vernon (69). 
However, a significant proportion of 
the variance in over-all potential as 
à successful leader, unaccounted for 
by LGD, may be predicted by a fairly 
Pure active or mechanical, initially 
leaderless, situational test which min- 
Imizes variance due to verbal ability. 

4. Compared to paper-and-pencil 
techniques, the LGD is expensive; 
Compared to the individual interview, 
in many locales, it may prove eco- 
nomical. The LGD appears feasible 
administratively, especially in mili- 
tary programs screening OCS or ad- 
vanced ROTC applicants, in civil 
Service examinations, in screening 
College seniors who are to be assessed 
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at their colleges for management 
trainee positions, and anywhere else 
where "boards" have been used 
traditionally, such as in the selection 
of public school teachers. 

5. While the LGD appears to have 
some validity as a predictor of the 
tendency to be a successful leader in 
a number of situations, especially in 
comparison to other assessment tech- 
niques, tailor-made batteries of paper- 
and-pencil tests will undoubtedly 
yield higher validities in designated 
situations. However, it may be that 
just as the brief intelligence test 
is applicable for predicting train- 
ability for many skilled occupa- 
tions, so the LGD will provide a 
general technique for partially as- 
sessing potential success as a leader 
in a relatively wide range of situa- 
tions. 

6. A number of situations in which 
the LGD is less likely to be success- 
ful may include the following: 

a. The LGD is less likely to be 
valid for measuring or forecasting 
esteem or leadership potential when 
examinees can be tested only among 
others of different rank. In such a 
case, status—and not esteem or per- 
sonality—will determine who suc- 
ceeds in the LGD. 

b. The LGD is less likely to be 
useful where factors peculiar to the 
situation block initiation of structure 
where no structure exists. Conceiv- 
ably, in certain military settings for 
example, the examinees may be im- 
bued with the dictum “never volun- 
teer for anything." However, exactly 
how this would affect LGD validities 
is unknown. 

c. Another unknown is the effect 
of the average verbal aptitude and 
educational status of the participants 
on the validity and utility of the 
LGD. It is expected that where this 
mean falls below a certain minimum, 
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LGD forecasting efficiency may suf- 
fer." 

d. Since achievement and intelli- 
gence appear to correlate with LGD 
performance as well as with success 
as a leader in real life, any restriction 
in the range of intelligence or achieve- 
ment of LGD participants would be 
likely to reduce the forecasting effi- 
ciency of the LGD. Conversely, the 
greater the variance in intelligence 


1. 


10. 


11. 


and achievement of participants, the 
more likely is the LGD to accurately 
assess leadership potential. 


e. Finally, the less the leadership 


situation for which we are selecting 
leaders requires verbal communica- 
tion or verbal problem solving, the 
less likely is the LGD to be useful 


as 


a measure or predictor of the 


tendency to exhibit successful leader- 
ship. 
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THE ATTENUATION PARADOX IN TEST THEORY! 
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Tue PARADOX 


In recent years several authors 
have called attention to a paradox in 
test theory. Gulliksen (7) appears to 
have been the first. He showed that, 
under reasonable conditions, ‘In 
order to maximize the reliability and 
variance of the test, items should 
have high intercorrelations, all items 
should be of the same difficulty level, 
and this level should be as near to 
50% as possible." But, he continued, 
"The criterion of maximizing test 
variance cannot be pushed to ex- 
tremes. Test variance is a maximum 
if half of the population makes zero 
scores, and the other half makes per- 
fect scores. Such a score distribution 
is not desirable for obvious reasons, 
yet current test theory provides no 
rationale for rejecting such a score 
distribution" (7, pp. 90-91). 

All studies to be reviewed in this 
paper assume homogeneous tests, in 
the sense that all correlations be- 
tween items within a test are ac- 
counted for by a single common fac- 
tor. Validity, as used herein, refers to 
correlation of the test with that com- 
mon factor. While the studies are 
explicitly concerned with reliability, 
in this context reliability refers only 
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to degree of homogeneity. There is 
no concern with stability of functions 
over time, 

Tucker stated the paradox as fol- 
lows: "Consider the case when all 
the items in a test are equivalent; 
that is, when the items all measure 
the same trait, have equal reliabili- 
ties, and are equally difficult. In this 
case the items are equally intercor- 
related with coefficients equal to the 
item reliabilities.... If the relia- 
bility of the items were increased to 
unity, all correlations between the 
items would also become unity and a 
person passing one item would pass 
all items and another failing one item 
would fail all items. Thus the only 
possible scores are a perfect one or one 
of zero” (19, pp. 1-2). He pointed 
out, as a consequence of this paradox, 
that under these circumstances in- 
creasing the reliability of a test be- 
yond a certain point will decrease the 
validity of the test, in contradiction 
to the usual belief, embodied in the 
correction for attenuation, that in- 
creasing the reliability of the test al- 
ways increases its validity. 

Brogden conceived of the problem 
as oneof “determining the distribution 
of item difficulties which will maxi- 
mize the correlation of the test with a 
perfect measure of the characteristic 
the test is intended to measure” 
(1, p. 197). With perfectly valid 
items, he pointed out, the difficulties 
should be equally spaced in some 
sense, whereas with items that do not 
intercorrelate, the difficulties should 
all be as close to .5 as possible. Brog- 
den stated that item selection pro- 
cedures aimed solely at increasing the 
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reliability of the test may result in a 
decrease in validity, again, the “at- 
tenuation paradox." 

Two writers who missed the at- 
tenuation paradox were Loevinger 
(11) and Cronbach (3). Loevinger 
called attention to the anomalous 
result of using equivalent items when 
the item intercorrelations approached 
unity and concluded that equivalent 
items were undesirable in the usual 
case. Since the phi coefficient and 
KR 20 (Kuder and Richardson's [10] 
formula 20) measure the departure of 
the items from equivalence as well 
as the interrelationships of the under- 
lying variables, she concluded that 
they were inappropriate as item- 
selection coefficients. 

Cronbach, in refuting Loevinger, 
stated: “The phi coefficient which 
tells when items do and do not dupli- 
cate each other is a better index 
just because it does not reach unity 
for items of unequal difficulty" (3, 
p. 329). Here Cronbach neglected 
the fact that if one uses the phi 
coefficient for item selection, one 
needs two rules: For lower values of 
phi, the higher the coefficient, the 
more will the two items contribute 
to the validity of the test. But for 
high values of phi, the lower the 
coefficient, the more will the two 
items contribute to the validity of the 
test. 

Similarly, Cronbach showed that 
the maximum value of KR 20 is not 
much less than unity for items with 
a specified distribution of item diffi- 
culties, and that the maximum value 
will drop for a greater range of item 
difficulties. If, however, maximizing 
KR 20 is made a rule for item selec- 
tion, as Cronbach recommends, there 
will be a tendency to select items with 
a narrower range of difficulty, or, 
in Cronbach’s terms, more redundant 
items. Here Cronbach apparently 


JANE LOEVINGER 


failed to see the paradox that where- 
as maximizing KR 20 for constant 
number of items will lead to increas- 
ing the validity of tests where the 
item intercorrelations are low, it will 
lead to decreasing the validity of 
tests where the item intercorrelations 
are high. 

The resolution of the attenuation 
paradox thus lies in having two rules 
for test construction. For the “classi- 
cal region,” the region in which the 
attenuation of validity decreases 
with increase in reliability, the closer 
the items are to difficulty of .5 and 
thus to equivalence, the more reliable 
and more valid will the test be. For 
the “region of paradox” the optimal 
distribution of item difficulties must 
be determined as a function of item 
intercorrelations. This solution was 
implied by Brogden and stated by 
Davis (5). 


DEFINITION OF THE REGION 
OF PARADOX 


Four studies have contributed to 
the definition of the region of para- 
dox, those of Tucker (19), Brogden 
(1), Davis (5), and Cronbach and 
Warrington (4). . 

Tucker (19) assumed that ability 
or true score is normally distributed, 
that probability of success on amy 
item is related to true score by the 
normal ogive, and that all items are 
equally difficult. He found that vith 
median equivalent items, i.e., equiva 
lent items of difficulty equal to > 
the validity of the test constantly 1- 
creased as the item reliability mM- 
creased for a one-item test. For a! 
item test optimal item reliability 
(interitem correlation) was about 3 
and for a 100-item test optimal item 
reliability was about .25. The maxi 
mum validity was .8 for a one-item — 
test or for a test with perfectly rei 
able items, however many, since 
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E case is equivalent to that of 


one item. Maximum validity was .97 
with 100 items and slightly over .9 
with 10 items. 

Tucker investigated a single case 
of non-median equivalent items. All 
items were of such difficulty that the 
ability level where the probability of 
passing is .5 was one standard devia- 
tion above the mean. (It will be 
convenient to refer to this case as 
tests for which z—1. Results areiden- 
tical for s=—1, that is, for tests 
where all items are of such difficulty 
that the ability level where half the 
individuals pass the item is one 
standard deviation below the mean.) 
In this case the optimal interitem 
correlation for a 10-item test was 
about .3, for a 100-item test about 
4$. The corresponding validities 
were about .83 and .96. For a one- 
item test the validity again increased 
às a function of item reliability to a 
maximum value of about .66. This 
value is also the terminal validity 
for a test of any number of items as 
the item reliability approaches unity, 
given that all items are of the speci- 
fied difficulty level. 

The optimal values of item reli- 
abilities under his conditions were 
surprisingly low, Tucker pointed out; 

owever, he cautioned that exceeding 

ese values caused less decrement in 
validity than falling short of them. 

urther, where item difficulties are 
Dot all equal, his results do not hold 
exactly, 

Brogden ( 1) assumed that the true 
Sore of ability was normally dis- 
tributed, that the tetrachoric cor- 
relations of all pairs of items within 
à test were equal, and the biserial r 
9! each item with true score was the 
Same for all items in a test. Phi co- 

cients between items and point 

Iserials between items and true score 
Were not always constant within a 
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test because items were permitted to 
differ in difficulty. Validity coefficient 
of the test was studied as a function 
of tetrachoric r of items, number of 
items, and distribution of item diffi- 
culties, Item inter-r's had the values 
.2, 4, .6, and .8. Numbers of items 
were 9, 18, 45, 90, and 153. All item 
difficulties were concentrated at half 
sigma units between and including 2.0 
and —2.0. The types of distributions 
were rectilinear, normal, skewed, and 
constant at each of the sigma values. 

Some of Brogden's results are dis- 
played in Fig. 1, 2, and 3. Figure 1 
shows validity as a function of item 
intercorrelation for tests composed of 
median equivalent items, that is, 
items for which z 0. The four curves 
correspond to four values for n, the 
number of items. Figure 2 is similar 
to Fig. 1, except that all items are 
of difficulty s=1. (The curves of 
Fig. 2 also apply if all items are of 
difficulty z- —1. While Fig. 2 
again shows the attenuation paradox 
as a function of number of items, 
comparison of Fig. 1 and 2 shows the 
attenuation paradox as a function of 
item difficulty. Figure 3 shows valid- 
ity as a function of distribution of 
item difficulties for tests composed of 
90 items. The curve for s=0 is also 
a member of another family of curves 
shown in Fig. 3. These curves show 
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what happens to a test of 90 equiva- 
lent items as the difficulty departs 
from median value. The data for 
these graphs are taken from Brog- 
den's Tables 2 and 3. 

Reading across the three figures, 
one sees that, in general, validity 
increases with increasing item inter-r 
up to a point and then decreases. 
The only curve which continues to 
rise in these figures is that for rec- 
tangular distribution of item diffi- 
culties. For a large number of non- 
median equivalent items, the optimal 
item inter-r is apparently less than 
.2. Paradox is exhibited whenever 
validity decreases as reliability (item 
inter-r) increases, that is, whenever 
the curve slopes downward. Brog- 
den's method, unlike Tucker's, does 
not permit ascertaining exactly the 
optimal value of item inter-r, so the 
computed points have been con- 
nected by straight lines rather than 
attempting to sketch a curve. Tuck- 
er's article contains actual curves 
which may be compared with the 
figures here. 

Reading up and down Fig. 1 and 
2, one sees that validity always in- 
creases with increasing number of 
items, provided item inter-r and dif- 
ficulty distribution are held constant; 
however, the increase in validity ob- 
tained by increasing n becomes small- 
er as the item inter-r increases. 
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Figures 1 and 2 show that op 
item inter-r decreases as s increa 
for constant item difficulty; in ¢ 
words, the region of paradox im 
creases as x increases. Comp 
of Fig. 1 and 2 illustrates that val 
ity drops as item difficulty de 
from the median value for constant 
n and item inter-r. The same com 
parison illustrates that for tests com- 
posed of equivalent items, the | 
of paradox increases as the item 
culty departs from the 
(Support for these generalizations 
contained in further data not repr 
duced here.) ER 

The two highest curves of F - 1 
show an important manifestation of 
the attenuation paradox: Median 
equivalent items produce tests c 
higher validity for low values 
item inter-r, while a rec j 
tribution of difficulty produces tests 
of higher validity for high values off 
item inter-r. The curve for normat 
distribution of item difficulties 
between that for rectangular dis 
bution and that for s=0 but w 
omitted to make the figure more teg 
ble. The point at which rectangu* 
distribution produces higher 
than median equivalent items 
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Fic. 3. ATTENUATION PARADOX AS 
TION OF LEVEL AND DISTRIBUTION O 
DIFFICULTIES FOR 90-Irem Tests. DATA! 
BnocpEN's (1) TABLES 2 AND 3. 
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be taken as defining the limits of the 
region of paradox. Utilizing data 
from Brogden's Table 2 shows the 
following: Higher validity was ob- 
tained by tests of median equivalent 
items than by tests with distributed 
difficulties for item inter-7's of .2 and 
A; however, for r of .4 and large num- 
ber of items (90 or 153), there was 
almost no difference in validity for 
tests with median equivalent items 
and tests with item difficulties dis- 
tributed normally or rectilinearly. 
For item inter-r equal to .6, the test 
with median equivalent items was 
superior for a 9-item test, ran about 
equal with tests of rectilinear and 
normal distribution of item difficulties 
for 18- and 45-item tests, and ran 
slightly behind those with distributed 
difficulties for 90 and 153 items. For 
item inter-r's equal to .8, distributed 
difficulties were clearly superior to 
equivalent items, more so as the num- 
ber of items increased. The rectilin- 
ear had a slight advantage over the 
normal distribution for tests with 
more than 18 items. 

The curve in Fig. 3 for skewed dis- 
tribution of item difficulties falls far 
below the curves for rectangular, 
normal, and median equivalent items. 
This decrement is chiefly a result not 
of skewness but of the departure of 
mean item difficulty from z—0. The 
rectangular and normal distributions 
do have means at z—0, whereas for 
the skewed distribution the mean 
Erud by the reviewer) is at 

For a given item inter-r, where 
tests are composed of items of con- 
stant difficulty, the validity falls off 
fairly sharply with the departure of 
that difficulty from the median; more- 
Over, the greater the departure of the 
difficulty from the median, the lower 
the optimal value of the item inter-r. 

omparing now the curve for skewed 
difficulties, which corresponds to a 
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mean z* 1.21, with tests of constant 
difficulty equal to 1 and to 1.5, one 
obtains at least a hint that distribu- 
tion of item difficulties protects the 
test from decrement of validity due to 
departure of difficulty from the medi- 
an, at least for moderate and high 
values of item inter-r. More striking 
is the fact that despite the distribu- 
tion of item difficulties, validity con- 
stantly falls with increasing item 
inter-r within the range considered, 
when the mean difficulty of the items 
is not appropriate for the group tested 
and the number of items is large. 
Practical considerations often lead 
to the use of a test with a group whose 
mean ability is not exactly the same 
as the mean difficulty of the items. 
Thus this important effect deserves 
much more extended investigation. 

Davis (5) assumed that the most 
desirable distribution of test scores 
is a rectangular one and sought the 
conditions under which such a dis- 
tribution could be obtained. He con- 
cluded that where the tetrachoric 
item inter-7's are .5 or less, the clos- 
est approach to a rectangular dis- 
tribution will be obtained with medi- 
an equivalent items. As the item 
inter-r rises above .5, the item diffi- 
culties should be increasingly dis- 

In comparing Davis' result 

with those of Tucker and Brogden, 
note that the latter investigators as- 
sumed that the true scores were nor- 
mally distributed. Validity coeffi- 
cients thus could equal unity only if 
obtained scores also were normally 
distributed. In effect, then, Tucker 
and Brogden assumed a normal dis- 
tribution as a desideratum while 
Davis assumed a rectangular distri- 
bution.* 

Cronbach and Warrington (4) as- 
sumed that the probability of success 

3 Davis' derivation, obtained from him in 
mimeographed form, appears to be somewhat 
less rigorous than the other two. 
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on any item is described by the nor- 
mal ogive; however, in contrast to 
the preceding studies, in which per- 
sons with no ability were assumed to 
have zero probability of passing the 
item, they dealt only with three- 
choice items. Thus persons with no 
ability had a 4 chance of passing 
any item. The standard deviation of 
the ogive, ou, was the same for all 
items within a test. Most of the tests 
had 30 items. The underlying ability 
was assumed to be normally distrib- 
uted. The scale value of each item 
is that level of ability for which the 
probability of passing the item is $ 
(or, if corrected for chance, 4). 
Items were located at scale values 
between 2.5 and —2.0, inclusive, at 
intervals of .5. Pattern A had 30 
items at scale value 0; pattern B 
had 6 at .5, 18 at 0, and 6 at —.5; 
pattern C had 6 each at 1.0 to —1.0 
inclusive; and pattern D had 3 each 
at 2.5 to —2.0 inclusive. Cronbach 
and Warrington's chief problem was 
outside the realm of the attenuation 
paradox. They were concerned with 
the screening efficiency of the tests 
Íor various possible cutting scores 
from close to zero to over 90%. 
Screening efficiency was measured by 
fw, of the dichotomized score scale 
against the continuous, normally dis- 
tributed scale of ability. For each 
test a series of such validity coeffi- 
cients was obtained and its curve 
plotted. 

For perfectly precise items (40) 
the validity of pattern A was great- 
est only in the neighborhood of a 
cutting score of .5, and fell sharply 
below that of other patterns as the 
cutting score deviated from the medi- 
an point. The range of cutting scores 
for which pattern A was superior 
increased as precision decreased un- 
til, for values of c;Z 1, pattern A 
was superior throughout most of the 
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range. A somewhat similai 
curves was generated by he 
tern constant and letting” 
cision vary; that is, the gn 
item precision, the more valid 
for cutting scores near the- 
but the less valid for extreme 
scores. This tendency was par 
ly clear for pattern A, whi 
called a “peaked” test, 
manifestation of the attenuati 
adox. E 
Cronbach and Warrington 
posed an interesting integration 
their findings. They sugg 
increasing the variance of the 
values of the items (¢,7) has about 
same effect as increasing the 1 
of the item ogive (e£), and that t 
important quantity to be co 
is the sum of the two Va 
Using eta rather than r as a 
of over-all validity, they fi 
under their conditions va 
creased as a function of oat 
a maximum where the sum 
variances was about .5, then | 
declined. im 
At one point they stated, 
as we can judge from these and Ti 
er's results, the peaked test has 
rior validity for even lower val 
oz (higher r;j) when the test is 
than thirty items" (4, p. 146) 
bach and Warrington's data 
effect of test length are scan! 
the data of Tucker and Brogden 
clear, and opposite to wh: 
bach and Warrington indicate: 
longer the test, the greater the 
of paradox. (See Fig. 1 an t 
An incidental finding of € 
and Warrington's study was 
peaked tests tended to ha 
or markedly skewed dist 
while the tests with dispe 
ficulties had more nearly no 
tributions. They did not 
the possibility that this 
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itselí a manifestation of the attenua- 
tion paradox. As reason for using 
ts they stated, "Unlike product- 
moment r, few is independent of the 
test score metric. Our results are 
therefore invariant as test scores are 
transíormed to other scales" (4, p. 
132. But, one may ask, what opera- 
tions correspond to transforming test 
wores to other scales? Given their 
conditions, only their scale results. 
The effect of various changes in 
conditions would have to be investi- 
gated. Indeed, it is generally con- 
sidered a requirement for use of ria 
that the dichotomized variable be 
normally distributed. The effect of 
using ni and eta as opposed to r 
apparently is to increase the ad- 
vantage of peaked as opposed to 
distributed item difficulties, and thus 
in effect to underestimate the region 
of paradox. 

In summary, the region of paradox 
is most clearly defined in the studies 
of Brogden and Tucker. Neither 
Davis’ study nor that of Cronbach 
and Warrington detected that the 
region of paradox increases with the 
number of test items, an effect clearly 
discernible in the two previous stud- 
is. If Cronbach and Warrington 
had used the product-moment r in- 
stead of eta they apparently would 
have obtained results comparable to 
those of Brogden and Tucker. Cron- 
bach and Warrington called attention 
to the importance of considering 
items where there is a nonnegligible 
probability of success without abil- 
ity. Their method appears better 
adapted to further exploration of this 
Problem than other methods, pref- 
erably with r used as evaluation 
function. A valuable extension of 
their study would include two-choice 
ànd four-choice items as well as ones 
With no probability of success without 
ability. Brogden's method has pro- 


concerning the limits of the region 
paradox and is well adapted to further 
investigation of the relative merits 
of concentrated and distributed difi- 
culties under circumstances where a 
test may be used for groups differing 
in mean ability. Cronbach and 
arrington's method could also be 


paper, being analytic, 
prove the most substantial contribu- 
tion to test theory of any reviewed 
herein. 


APPLICATION TO TEST 
CONSTRUCTION 


The statistical phenomena sum- 
marized by the phrase ‘‘the attenua- 
tion paradox” reveal the importance 
of the mean and variability of the 
standardization sample in the evalua- 
tion of a test's validity. Reading up 
and down Fig. 3, one can interpret 
the curves for differing values of s 
as showing a striking decrement in 
validity for tests composed of median 
equivalent items when they are 
subsequently applied to groups hav- 
ing a different mean on the trait 
measured. Similarly, the validity co- 
efficients obtained for a hypothetical 
set of tests with decreasing disper- 
sion of item difficulties corre 
to those obtained by rit the 
items as constant but administered 
to groups of increasing variability. 
Generally speaking, the standardiza- 
tion sample should have the same 
mean and variance as the group on 
which the test will ultimately be 
used. In practical situations usually 
no such identity can be guaranteed, 
certainly not for the life expectancy 
of a well-constructed test. | 

For tests in the low homogeneity 
or “classical region,” as the variance 
of the sample increases, mean held 
constant, the validity coefficient in- 
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creases. Thus, to obtain a lower 
bound to the validity coefficient, the 
standardization sample should be at 
most as variable as any sample on 
which the test will be used. 

For tests in the “region of para- 
dox” the situation is more compli- 
cated. For a given degree of item 
intercorrelation and a given number 
of items, there is probably an optimal 
distribution of item difficulties. The 
higher the item inter-r, the more it 
is true that each item added to the 
test adds validity only if it differen- 
tiates at a different level of difficulty 
than those items already included. 
Any considerable increase in the vari- 
ability of the group will decrease the 
validity of the test. Thus, to obtain 
a lower bound for the validity co- 
efficient, the standardization sample 
should be at least as variable as any 
sample on which the test will be used. 

Variation in the mean of the appli- 
cation sample from that of the stand- 
ardization group must also be con- 
sidered. Cronbach and Warrington 
concluded that where general use- 
fulness over a period of time is sought 
items should be concentrated around 
median difficulty; however, they con- 
sidered tests as selective instruments, 
which is legitimate but different from 
the present consideration of tests as 
measuring instruments, and their use 
of biserial correlation is question- 
able. Whether concentration of item 
difficulties makes a test more sus- 
ceptible to loss of validity in applica- 
tion to groups of different means is 
a point on which evidence is not 
yet available. 

Theoretical and practical consider- 
ations may be drawn together as 
follows: When application to a single 
group is considered, concentration of 
item difficulties is called for in the 
case of item inter-r’s usually met, 
which are low. Consideration of use 


JANE LOEVINGER 


of tests for differing groups will lead 
to dispersion of item difficulties for 
some cases, i.e., where item inter-r's 
are not too low. In practice, however, 
it is usually not possible to find large 
numbers of items exactly at the 50-50 
point. The consequent inevitable dis- 
persion of difficulties is in many 
cases probably desirable. A method 
of item selection that favors median 
items but does not exclude good items 
with more extreme difficulties seems 
indicated by these considerations. 
One such method has been proposed 
recently (13), and no doubt other 
methods have this property. That 
one widely used method of item 
selection has a very different effect 
will be shown in the next section. 


IMPLICATIONS FOR TEST THEORY 


Consider the definition: A para- 
doxical property of a test is a property 
such that the validity of the test 1s 
not a monotonic function of that 
property; ie., validity is sometimes 
an increasing and sometimes a de- 
creasing function of the property. | 

The objections to taking validity 
as the focal concept for test theory 
are well known. Probably all psycho- 
metricians would agree that test 
theory must have basic concepts 
which refer to intrinsic properties of 
tests. Is it not intuitively valid, 
however, to demand that the most 
basic concept of psychometrics shall 
be a nonparadoxical property of tests? 
Are there such nonparadoxical proP- 
erties? i 

Clearly reliability, or one of it$ 
cognate concepts, is the focus 9 
present-day test theory; the ste m 
cal theory of reliability is the bulk o 
classical test theory. There x 
been many criticisms of the reliabi- 
ity concept; those by Thorndike (17); 
Cronbach (2), and Loevinger \ 
have a good deal in common, particu" 
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larly the distinction between stabil- 
ity and homogeneity, which is lost in 
the ordinary usage of "reliability." 
The purpose of the present article 
is not, however, to review all the 
objections to the reliability concept 
but to draw attention to a single in- 
stance in which the statistical theory 
of reliability leads to self-contradic- 
tion. 

Such solutions and explanations of 
the attenuation paradox as have been 
proposed, those of Brogden, Davis, 
and Cronbach and Warrington, are 
appeals to common sense completely 
outside traditional theory of relia- 
bility. Perhaps the most paradoxi- 
cal aspect of the attenuation paradox 
is that Gulliksen, who appears to 
deserve credit for discovering it, 
failed to include any reference to it 
in his comprehensive summary (8) 
of mental test theory. In his own 
words, “Current test theory provides 
no rationale for rejecting" the anom- 
alous score distribution. 

Lord (14) has probably come clos- 
est to integrating the attenuation 
paradox with classical test theory. 
He points out that while Pearsonian 
correlation between test score and 
ability decreases in what is here 
called the region of paradox, the 
curvilinear correlation with ability 
constantly increases with increasing 
item inter-r. Unfortunately, however, 
consideration of curvilinear correla- 
tion obscures the paradox, which 
does in fact exist. Lord concerns 
himself chiefly with the classical re- 
gion; it is not clear that his approach 
could lead to the distinction between 
the two regions of test theory. 

. Reliability is paradoxical; satura- 
tion, a concept imbedded in a method 
of test construction recently proposed 
by Loevinger, Gleser, and DuBois 
(13), is virtually identical with one 
of the Kuder-Richardson (10) co- 
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efficients and thus is paradoxical in 
the same region. Guttman's (9) con- 
cept of scalability and Loevinger's 
(11) concept of homogeneity clearly 
differ from reliability and saturation. 
Are these properties paradoxical? 
Consideration of the results quoted 
in the second section of the present 
paper shows that these properties are 
also paradoxical, not, however, in 
what is here called the region of 
paradox but in the classical region. 
To distinguish the two kinds of para- 
doxical properties, scalability and 
homogeneity may be called neo- 
paradoxical properties. Only scale 
theory will be considered. Its im- 
portance is pointed up by the fact 
that virtually all recent contributions 
to test theory in sociology and closely 
related disciplines are phrased in 
terms of scale analysis. 

Guttman originally measured scal- 
ability in terms of a “coefficient of 
reproducibility," which is the pro- 
portion of the responses of the group 
which can be reproduced from knowl- 
edge of item difficulties and total 
scores. Guttman distinguished 
"scales" from "'quasi-scales" accord- 
ing to whether the coefficient of 
reproducibility exceeded .9, assuming 
certain other conditions were satis- 
fied. Festinger (6), Loevinger (12), 
and probably others criticized the 
distinction between scales and quasi- 
scales as arbitrary. According to 
these writers, Guttman and his fol- 
lowers were erecting à qualitative 
difference out of a purely quantitative 
one. The considerations of the pres- 
ent paper, however, justify what was 
apparently a purely intuitive dis- 
tinction on the part of Guttman. 
When we deal with cumulative (12) 
dichotomous items, which is the usual 
case (16), the distinction between the 
two regions of test theory can be 
clearly, if laboriously, drawn. What 
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Guttman called quasi-scales are tests 
in what is here called the classical 
region; what Guttman called scales 
are tests in what is here called the 
region of paradox. There is no evi- 
dence, however, that workers in the 
field of scale analysis are aware of 
the precision with which this dis- 
tinction can be made, nor of the con- 
sequences of the distinction which 
have been elaborated by psycholo- 
gists working in test. theory. 

A number of objections have been 
raised against the original coefficient 
of reproducibility, a major one being 
that its lower limit can be arbitrarily 
raised by selecting items extreme with 
respect to difficulty (or popularity). 
In practice, workers in this field have 
probably always more or less taken 
this fact into consideration. A recent 
paper by Menzel (15) has proposed a 
modification of the coefficient of re- 
producibility which eliminates this 
problem. Clearly, insofar as use of 
Guttman's coefficient led to the selec- 
tion of items extreme in difficulty as 
opposed to median items, it led to a 
decrease in validity of the resultant 
test, since under none of the condi- 
tions investigated above did a selec- 
tion of extreme rather than median 
items lead to an increase in validity. 

Stouffer, Borgatta, Hays, and 
Henry (16) report that, in practice, 
in order to derive scales rather than 
quasi-scales from available data, it 
has been necessary to select one from 
among several apparently equally 
good items at any given difficulty 
level. According to them, probably 
the most common method of im- 
proving scalability has been reduction 
of number of items per test, often to 
no more than four or five items. 
Inspection of Fig. 1 and 2 and other 
available data reveals no set of con- 
ditions under which reduction in 
number of items increases validity; 
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on the contrary, increase ii 
of items invariably in "s 
most markedly when the mui 
items is small. 

A further consequence of 
concept of perfect scales as 
sideratum in test construc 
been that test constructors 


or less evenly spread in dif 
Such selection will tend to 
scalability, but in the classi 
it will tend to decrease valid 

In summary, Guttman's- 
cient of reproducibility gives 
tage to extreme as opposed to 
items. If this coefficient is 
so as to give advantage mei 
extreme nor to median items, 
of scalability still leads to d 
in number of items and to di 
of item difficulties. But the 
cited in this review is that d 
in number of items always | 
decrease in validity, other 
equal; there is no set of 
under which extreme items le 
more valid tests than median 
and in the classical region, d 
of item difficulties leads to d 
in validity. There is reason 
lieve that most of the attitude te 
currently in use fall in the classic 
region. One may conclude the 
tensive use of scale analysis. 
most certainly led to loss of và 
in tests used in sociology. ^ — 

In recognition of the diffic 
to which scale analysis has led 
or less akin to those cited here; 
fer, et al. (16) have proposed a. 
cation of the method. They 
to have about five items pel 
but each item is a “contrived 
A contrived item is a comp 
several items similar in level 
culty, but each contrived | 
scored zero or one, dependi 
number of pluses in the i 
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which it is composed. The contrived 
items are constructed by a cut-and- 
try method, The authors show that 
tests constructed by their method are 
superior to those constructed by the 
more common method of scale analy- 
sis with respect to scalability and 
some definition of reliability, but 
only where the original items are 
quite good. Their method appears 
to be a compromise between scale 
analysis and methods favoring choice 
of equivalent items, to which most 
psychologists would lean. While the 
new method may ameliorate the 
flaws of scale analysis, it does not 
solve the conceptual difficulty. Scal- 
ability, like reliability, is a paradoxi- 
cal concept. Worse, scalability is 
paradoxical in the classical region, 
which is heavily populated with 
tests, while coefficients such as the 
Kuder-Richardson (10) reliability co- 
efficients are paradoxical only in the 
region of paradox, which is thinly 
populated with tests.‘ 

The problem of finding nonpara- 
doxical properties of tests remains. 
A property different from those dis- 
cussed above which may qualify is 
the discriminating power of the test. 
Many writers have used this term 
without definition, as if it were self- 
explanatory. Several recent papers 
have provided indices of discriminat- 


* Dr. Ledyard Tucker has, however, called 
my attention to the fact that a problem being 
Worked on by John Keats at Princeton Uni- 
versity provides a mathematical model for 
the method of “contrived items.” Keats as- 
sumes that the probability of success on an 
item increases with ability at only one point 
on the scale of ability, rather than describing 
an ogive, It is not assumed that the probabil- 
ity of success on the item is zero below that 
Point nor unity above it, only that it is con- 
stant everywhere but at the single point. 
One has difficulty thinking of test content for 
Which this assumption is as reasonable as the 
assumption of a graded increase in the prob- 
ability of success on items. 


approach will be selected arbitrarily 
to show that this concept differs 
from concepts like reliability and scal- 
ability. 

Loevinger, , and DuBois (13) 


ferences in the trait which can be 
discriminated. Discriminating prob- 
ability refers to the proportion of 
discriminations which are in the 
same direction as trait differences. 
Discriminating range refers to the 
general level of the trait at which dis- 
criminations are made, No single co- 
efficient measures the three aspects 
of discriminating power; however, 
some coefficients are closely related 
to discriminating fineness, some to 
discriminating probability, and some 
to both. If the test construction 
process is conceived as beginning 
with the best few items in a finite 
pool and adding items one at a time 
from the pool in the order of the 
goodness of the items, then coeffi- 
cients measuring only fineness con- 
stantly increase, those measuring 
only probability constantly decrease, 
while those measuring both may in- 
crease at first and then decrease. 
The latter type of coefficient pro- 
vides a basis for deciding when to 
stop adding items to the test. 

This definition of discriminating 
power may prove unappealing to 
many psychometricians, since no 
single coefficient corresponds to it 
and since it is essentially an intuitive 
rather than a quantitative concept. 
Yet other quantitative disciplines 
begin frankly with intuitive concepts. 
As Ruth Tolman (18) has recently 
observed, among physical scientists 
it is the highest compliment to speak 
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of a fellow scientist as having “‘physi- 
cal intuition," but among those psy- 
chologists who most desire to emulate 
the physical scientists, intuition is 
rarely referred to in a similar compli- 
mentary sense. 

In summary, paradoxical property 
of a test is a property such that valid- 
ity is not a monotonic function of that 


property. Reliability is a property 
paradoxical in a region lightly popu- 


la 


ted with tests. Scalability is a 


property paradoxical in the region 
heavily populated with tests. Other 
possible properties, such as the dis- 
criminating power of the test, have 
not been fully investigated. 
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REGRESSION ANALYSIS: PREDICTION FROM 
CLASSIFIED VARIABLES! 


ROBERT M. GUION 
Bowling Green State University 


Occasionally in psychological re- 
search a qualitative independent var- 
iable seems promising in the predic- 
tion of a quantitative variable. Some- 
times, particularly in applied re- 
search, data for basically continuous 
independent variables must be ob- 
tained, if at all, in terms of broad, 
discrete categories, possibly of un- 
equal intervals. Such variables are 
at best inconvenient to work with 
in prediction problems; all too often, 
the difficulties they present are solved 
by ignoring them, resulting in the 
loss of a potentially useful predictor. 

In a study of supervisors (2), 
the writer was faced with this prob- 
lem. It was necessary to develop a 
regression equation by which a de- 
pendent variable (number of employ- 
ees supervised) could be predicted 
from such independent variables as 
man-hours of assistance, rough classi- 
fications of plant size, and the purely 
qualitative variable of industry clas- 
sification. The problem was solved by 
à technique of regression analysis. 

Regression analysis is a least- 
Squares regression technique per- 
mitting the prediction of a quantita- 
tive variable from categorized or 
qualitative variables. There seems 


! This article describes a procedure used in 

à thesis submitted to the Graduate School of 

Purdue University in partial fulfillment of the 

Tequirements of the degree of Doctor of 

ilosophy, carried out under the direction 

a Dr. C. H. Lawshe, and sponsored by the 
urdue Research Foundation. 

The theory and procedure presented here 
"nder the name of regression analysis were 
i eloped primarily by Professor I. W. Burr 
of the Statistical Laboratory of Purdue Uni- 


to be no particular limit to the num- 
ber of variables that can be handled 
by this method, as there is in the 
usual multiple regression techniques. 
Apparently the absence of products 
in the resulting regression equation 
reduces the likelihood of cumulating 
to a serious degree the errors of meas- 
urement. It is, of course, possible 
that spurious correlation might re- 
sult from including too many vari- 
ables. 

It is beyond the intent of this re- 
port to give a complete account of 
the theory and practice of regression 
analysis. The procedure, which has 
been used in agricultural research, is 
capable of further refinement and 
needs evaluative research. A de- 
scription of the basic theory and of 
the method’s application in agricul- 
ture is provided by Anderson and 
Bancroft (1). This report seeks mere- 
ly to outline the procedure as modi- 
fied in the writer’s use of it. 

The technique will be discussed and 

outlined by example, using a three- 
variable situation that might occur 
in personnel research. For purposes 
of illustration, we will be concerned 
with predicting the average number 
of working days attendance per 
quarter year of machine operators. 
The prediction will be based upon a 
knowledge of the applicant's cate- 
gorical standing in each of three 
variables: 
Ee Se re 
versity. It was subsequently learned that 
similar procedures based upon the same 
mathematical theory have been applied to 
biometric data. 
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Variable A: Preference for mechanical 
work 


1. First or second choice 
2. Third choice or not indicated 


Variable B: Housing status 


1. Own home 
2. Rent home 
3. Rooming 


Variable C: Presently employed? 


1. Yes 
2. No 


There are seven categories in all 
in this example; any applicant may 
be classified as being in three of them. 
Regression analysis seeks to estimate 
eight parameters: a, os for the two 
categories of variable A; 61, 82, 83 for 
the three categories of variable B; 
‘1, Y2 for the two categories of vari- 
able C; and yu for the mean attendance 
(dependent variable) of the general 
population of applicants. The sum 
of the parameters for the categories 
describing any individual, when 


added to the population mean, yields _ 


the dependent measure for that indi- 
vidual within limits of error, such 
that the regression model is 


x = ai + B; + m +n + error. 


From a sample available for study, 
we can estimate these parameters 
(estimates indicated by italic letters) 
so that 


x= a; + bit atm. 


In other words, an adjustment fac- 
tor is assigned to each category of 
each variable. For any individual 
applicant in the illustrative situa- 
tion, the predicted attendance record 
would be equal, within limits of error, 
to the mean attendance record of the 
total sample plus the algebraic sum 
of the adjustment factors of the cate- 
gories that describe him. 
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A least-squares solution of th 
parameter estimates is sought | 
using a system of eight simult: 
equations (the number of 
plus one) by the following proc 
1. Prepare the matrix of the sy 
of equations. The equation 
first category of the first variab 


XX., is the sum of the measu 
the dependent variable (i.e., a 
ance in days) for all cases ap 
in the first category of : 


application blank a first or 
choice for mechanical work. 


pearing in this category is designat 
"4. The notation Maslo, indicates 

number of cases in this cate; 
which are also classified in the first 
category of variable B: those 
own their homes. The next six. e 


variable A, the first categor 
variable B, and so on. 

The final equation is for the 
mean and is 


N 
Xx. 
i=l 


= nah — nah — inbi — 


Transposing all paramete 
mates and their coefficients 
matrix equal to the vector of th 
pendent variable. This 8 by 8 m 
of independent variables consis 


the unknown parameter esti 
The complete system of equa 
matrix form blocked off for ilh 
tive purposes, is shown as T: 
It will be seen that the to ang 


quency in any given category appears 
in the diagonal, and that the coeffi- 
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cients of the diagonal are equal to the 
eoefficients of the last equation for 
the general mean. 

The sample used in this illustra- 


tion co. ists of 49 machine operators, 
26 of whom have indicated machine 
work cither as their first choice or 
second (first category, variable A). 
Of these 26 men, 7 own their homes, 6 
are renting, and 13 are rooming or 


living with their parents (the three 
categories of variable B, respective- 
ly). They totaled 1,417 working 
days attendance during a three- 
month period. 

Blocking the matrix off as shown 
in Table 1 provides a convenient 
check on the accuracy of the tabula- 
tions made: the sum of the frequen- 
cies in any given block must be equal 
to the total number of cases in the 
sample, unless there are individuals 
included for whom complete data 
are not available. In any column 
Within a block, the sum of the co- 
eficients is equal to the coefficient 
in that column in the equation for 
the general mean. 

2. Reduce the matrix. This system 
of equations, as it now stands, is 
additive since the sum of the coeffi- 
Gents of the equations for each vari- 
able yields the appropriate coeffi- 
Gents for the equation for the general 
Mean. This set of equations does 
Not have full rank and therefore has 
no unique solution. 

À solution can be obtained by 
making the restriction that the sum 
of the estimated parameters, or a 
Weighted sum, be equal to zero for 
any given variable. With this re- 
striction, the process of reparametri- 
zation, discussed by Kempthorne (3), 
‘an be applied. Reparametrization 
Seeks to replace certain parameters 
and solve for a different set. It can 


AH 
ut 
AM 
ii 
dl 
E 
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Ẹ 
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category of each variable (coefficien 
in Table 1, italicized), we have the 
following reduced 5 by 5 matrix of 


equations to solve instead of the origi- 
nal 8 by 8: 
206 7 671] 98 1417 
7113 0 10:13 743 
6 0 15 6 15|*| 842}, 
11 10 6 23 23 1260 
26 13 15 23 49 2700 


This reduced matrix can be solved.* 
The solutions for the illustrative 
problem are shown in Table2.  . 

3. Solve the equations. There are, 
of course, many methods of solving 
sets of simultaneous equations. How- 
ever, in systems as large as will be 
found in most cases of regression 
analysis, loss of significant figures in 
the course of computation will be- 
come a serious problem if direct 
reduction or a determinant 
is used. In the supervisory study 
cited, for a matrix of 27 equations, 
the Gauss-Seidel iterative method 
was used. The 134 iterations required 
to reach a solution were performed on 
the IBM Card-Programmed Calcula- 
tor using a procedure described by 


3 The simplification of reparametrization is 
largely y gps of work done by L. E. 
Grosh, T. E. Cheatham, and Professor iain 


jet due University, for his labor in solving 
the problem used here for purposes of illustra- 
tion. 
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TABLE 1 


Matrix OF COEFFICIENTS OF THE UNKNOWN PARAMETER ESTIMATES AND 
Vector OF DEPENDENT VARIABLE 
Ümm EELELCEIGLELELCE!LLLIHLLOeL 


Coefficients of Parameters 


: Vector 
Categories ——————————— 
bi b b: a C m (2X) 
Variable A 
Category 1 7 6 13 11 15 26 1417 
Category 2 6 9 8 12 11 23 1283 
Variable B 
Category 1 13 0 0 10 3 13 743 
Category 2 0 15 0 6 9 15 842 
Category 3 0 0 21 7 14 21 1115 
Variable C 
Category 1 10 6 7 23 0 23 1260 
Category 2 3 9 14 0 26 26 1440 
General Mean 13 15 — 21 23 26 49 2700 


Note: Italicized values are those eliminated in reparametrization, 


4. Convert the obtained values. The set equal to zero, if the population 
solution after reparametrization can be assumed to be equally dis- 
yields for the illustration the values tributed among the various categories 
ay’, by’, bo’, cx’, and m’, since the last of the variable. 
category of each variable had been 2. Thesum of the estimated param- — 
set equal to zero. Conversion of eters of each variable, weighted ac- 
these estimates to estimates of the cording to the observed frequencies 
original set of parameters is ac- of the categories, can be set equal to 
complished by simple algebra, the zero, if it is assumed that the sample 
actual process depending upon the is random. 


restriction first imposed. One of 3. The sum of the estimated param- 
three restrictions might have been eters of each variable, given à | 
made: priori weights on the basis of known 


1. Thesum of theestimated param- or hypothesized population frequen- 
eters (or adjustment factors) can be cies, can be set equal to zero. 

The second restriction seems more 

TABLE 2 common and will be used in the exam 

SoLuTioNs TO ILLUSTRATIVE Equations ple. Following Kempthorne (3), it 


can be shown that a: +ar=0i It | 
Parameter Prime Parameter — is therefore necessary to find the kth 
voli Estimate value for each variable, i.e., the vum | 
a —1.048 —0.492 eliminated in reparametrization. The 
ai +0.556 "mv. ial 
restri that 9,0; 0, Of 
by 44.918  +2.689 n raiti =, 
x peg ee. the example where variable B has 
“a —1.865 —0.990 three categories, Ym.-0. Sub- 
a 30.875 T rs 
m +54.316  +55.102 stituting, this becomes in full 


* Solution to reparametrized matrix. ny (bi! +53) + as (s +b) ny 
4 
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Simplifying and solving for bs, 
mbi! + mb! 
N 
or, in the illustration, 
$ (13) (4.918) + (15)(3.020) 
49 


ù= — 


b = 


= — 2.229, 
and, 
b, = bi’ 4-0, = 4.918 — 2.229 = 2.689 
b= by! 4-03 = 3.020 — 2.229 — 0.791. 


The same general procedure is used 
to obtain the correct parameter esti- 
mates, or adjustment factors, for 
variables A and C, shown in Table 2. 
For the general mean, it is best to 
use the simple arithmetic mean as 
normally computed because of the 
rounding errors which creep into the 
corrected value after the solution 
through reparametrization. In this 
problem, the mean is 55.102. 

With these values known, the linear 
equation 


x=a+b;+a+m 


can be used for each applicant. In 
the illustration, we can now, by 
knowing the applicant's classification 
in the three variables and by knowing 
the mean attendance of the sample, 
Predict the applicant's job perform- 
ànce in terms of expected attendance 
per quarter. 

For example, an applicant who lists 
mechanical work as his first job 
choice (category a —0.492), who 
owns his home (category bi, +2.689), 
ànd who is presently employed (cate- 
gory c, —0.990) would be expected 
to have a quarterly attendance rec- 
ord, in integral units, of 56 working 

ays, the prediction formula being 


* = 55.102 — 0.492 + 2.689 — 0.990 
= 56.309. 
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The relative influence of each inde- 
pendent variable can be gauged ac- 
cording to the variance of the param- 
eter estimates, such that, in this 
problem, variable B carries a heavier 
weight in prediction than variable C, 
which is in turn weighted more heav- 
ily than variable A. 

It should be pointed out that this 
problem has been illustrative only; 
where the units are discrete rather 
than continuous, which is true of 
both the frequencies of individuals 
in various categories and of the meas- 
urements of attendance, there is 
probably very little justification in 
attempting to achieve greater than 
integral accuracy. 


CONCLUDING COMMENTS 


Regression analysis is a least- 
squares multiple regression technique 
permitting the prediction of quanti- 
tative variables from qualitative or 
classified variables. As outlined here, 
regression analysis assumes no signif- 
icant interactions between variables. 
If interaction is found to exist, ad- 
ditional equations can be introduced, 
treating combinations of interacting 
variables as separate classifications. 
It should be pointed out, for the sake 
of economy, that any error made by 
a false assumption of no interaction 
is an error of underestimating rather 
than overestimating the relation- 
ship; the effect is one of ignoring addi- 
tional variables which could have in- 
creased the predictive efficiency. 

The regression equation derived 
írom this method does not directly 
provide certain interpretational data. 
However, the predicted values of the 
dependent measure can be correlated 
by conventional procedures with the 
actual values. These correlation co- 
efficients can then be used to derive 
coefficients of determination and 
standard errors of estimate. 

The potential uses of the method 


tory analysis and to profile 
Nevertheless, it should be made 
subject of empirical investigati 
Several research questions need an- ables used. 
swering. Its predictive effectiveness 
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ESTIMATING THE SCALABILITY OF A SERIES OF 
ITEMS—AN APPLICATION OF 
INFORMATION THEORY 


RICHARD WILLIS 
University of Minnesota 


The powerful scaling technique 
developed by Guttman and others 
(3, 4, 5, 10) has several advantages, 
some of which seem to be unique. 
The merits and limitations of the 
method have been evaluated else- 
where (1, 2, 6), and they will not be 
reviewed here. The present paper 
will be restricted to a consideration of 
the problem of estimating the degree 
of scalability present in the area in 
question; i.e., the degree to which 
items in the scale are measuring the 
same thing in the population mem- 
bers. 

Guttman discusses several factors 
which should be considered in this 
connection, namely, (a) the number 
of items, (b) the number of respond- 
ents, or size of the population sam- 
ple, (c) the number of errors, or re- 
sponses which do not fit the pattern 
of a perfect scale, (d) the distribution 
of these nonfitting responses, (e) the 
number of response categories of 
the items, and (f) the distribution of 
the item marginals. The first three 
of these criteria are used to compute 
a coefficient of reproducibility r, 
Which is equal to 1— pg, where pz is 
the proportion of errors among the 


‘total number of responses. This 


Coefficient should equal or exceed 
‘90. The fourth factor is determined 
Y an inspection of the response 
Pattern. Nonfitting responses should 
be well scattered, indicating a ran- 
dom distribution, rather than appear 
in clusters, which would indicate a 
Systematic distortion of the scale 
Pattern. The last two criteria are ac- 


counted for by rule-of-thumb pro- 
cedures. The more response cate- 
gories given to the items individually, 
and hence the larger total number of 
response categories, the less likely is 
a scale pattern to appear by chance 
alone. It is recommended that the 
number of items be ten or more if 
they are all dichotomous, so that the 
number of response categories is at 
least twenty. Likewise it is recom- 
mended that there be included no 
more than a few items that have 
almost all responses lumped under a 
single alternative, as such items in- 
evitably boost the coefficient of re- 
producibility. For example, an item 
to which 95 per cent of the respond- 
ents answer "agree" could not possi- 
bly result in more than 5 per cent 
nonfitting responses, while an item 
with a 50-50 split between two cate- 
gories could theoretically result in 50 
per cent errors. Items with more 

two alternatives might produce even 
at best more than 50 per cent non- 
fitting responses if not removed from 
the series. 

Because of the effects which the 
number of categories and the distri- 
bution of category frequencies have 
on reproducibility coefficients, they 
are not strictly comparable as gener- 
ally computed. A means of quantita- 
tively accounting for these effects 
would be especially useful because 
it is standard practice to administer 
each item with several categories, 
and then to combine adjacent cate- 
gories in an attempt to get r up to 
.90. And, of course, the category 
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frequencies are changed at the same 
time. Another common practice in 
which these effects play an important 
part is that of selecting a subgroup of 
items from among those adminis- 
tered. In both cases the manipula- 
tion is selective, favoring the result 
sought by the experimenter, and 
should be corrected for. 
The concept of entropy! has been 
applied to the theory of information 
by Wiener (11) and by Shannon (9). 
This interesting concept provides the 
means for such a correction. The en- 
tropy H associated with a contingent 
event is an indication of the number 
of possible outcomes it can have. It 
is consequently a measure of the un- 
certainty that the event will occur 
in any specified one of these possible 
ways. The toss of a die, with six 
equally likely possible outcomes, in- 
volves more entropy than the toss of 
a coin, with only two equally likely 
outcomes possible. Knowing the 
outcome of a high-entropy situation 
gives us more information, in the 
sense that more uncertainty is re- 
moved, than knowing the outcome of 
a low-entropy situation. 

How does this apply to estimating 
the degree of scalability present in an 
area? If we make our estimation 
first under one set of conditions and 
then make a second estimation under 
a different set of conditions having 
a different associated entropy value, 
the two estimates are not comparable 
unless we can allow for this difference. 
This is true whether we have tested 
in the same area twice or in two com- 
pletely different areas. The situation 
is somewhat analogous to comparing 


1 It has been so named because of its mathe- 
matical and conceptual similarity to the term 
entropy used in statistical mechanics, which in 
turn was named after the entropy of classical 
thermodynamics. For another application of 
entropy-like considerations in psychology, see 
Miller and Frick (7). 
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two chi-square values without c 
sidering a difference in the . 
freedom. Let us imagine, for exam- 
ple, that we have given ten trichot- 
omous items to 100 respondents. 
Then we arrange the responses so 
that they form as nearly as possible a 
scale pattern, using any of the avail- 
able methods (3, 5, 10). Say the co- 
efficient of reproducibility r turns 
out to be .80. This does not meet the 
accepted standard of .90, and 
begin to judiciously combine adjacent 
categories so that we are left with ter 
dichotomous items and an r of 
Is this really any stronger evi 
that the area is satisfactorily sca 
The proportion of responses 
do not fit the scale ete has been 
cut in half, but perhaps the restruc- 
turing of the situation has reduced | 
the likelihood of such nonfitting re- 
sponses by as much or more. 4 ie 
more we manipulate the situation to 
our advantage, the less chance there 
will be for errors to show themselves. 
Our judicious manipulation reduces 
the entropy and imposes greater re 
strictions on the responses, forcing 
them to more nearly fit the scale 
tern. 
To see how much this chance 
errors to appear has been à 
let us calculate entropy values tof 
conditions before and after combining 
categories. The formula is 


H-2-— Y. pi loge pis 
i=l 


in which 5; is the relative probab 
of the event i occurring? As ĝi 
always be less than 1, if there 15 
uncertainty whatever about the 0 
come, logsp; will be negative and 
2 Shannon (9) discusses the characti 
which a good indicator of uncertainty. 
have and shows that this is the only fe 
expression which does have these ch 
istics. 
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will be positive. The logarithm is 
taken to a base of 2 only because H 
then takes on a convenient numeri- 
cal value—it then indicates how 
many times the number of equally 
likely possibilities must be halved 
to remove all uncertainty about the 
outcome. An H value of 1.00 would 
represent an event such as the toss 
of a coin, in which the equally likely 
possibilities are reduced from 2 to 1 
when the outcome becomes known. 
This same H value would also char- 
acterize a single respondent checking 
a dichotomous item, assuming we 
have no a priori knowledge about 
which category he is more likely to 
endorse, i.e., assuming 55 57.5. In 
either the case of the coin or the single 
respondent answering a single di- 
chotomous item, formula 1 reduces to 


H = — 5 loge .5 — .5 log: 5 
= — 5(— 1)— .5(- 1) = 1.00. 


But in our imaginary example above, 
we had 100 respondents instead of 
only one, and so we need not assume 
that all the presented choices for 
an item are equally likely to be chos- 
en. We obtain an estimate of the 
relative probabilities from the re- 
sponse frequencies, and the larger 
the group, the better will be our esti- 
mates. The response frequencies give 
us a set of p;'s for each item such that 
2p:=1.00; using these p;s, we can 
calculate an H value for each item 
in the series. Then we sum item H's, 
and the result is an H value char- 
acteristic of our series of items. This 
Value indicates how much informa- 
tion is obtained (i.e., how much un- 
certainty is removed) each time we 
administer our item series to a re- 
Spondent. The formula may be 
Written 


Z8 
H--» È pu logs Pin Q 
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where py is the relative probability 
of category 4 on item j being en- 
dorsed, I is the number of items in 
the series, and c, is the number of 
response categories presented with 
item j. The summation is first over 
categories within items and then over 
items. Note that this formula does 
mot take into consideration the pat- 
terning of the responses. It is the 
coefficient of reproducibility which 
does this, and that is why the two 
measures must be considered together 
in order to see the whole picture. 

Returning once more to our exam- 
ple, suppose we compute the entropy 
before, Hı, and after, Hs, combining 
categories with the result that M/H, 
=2. Then the nonfitting responses 
have been reduced only in proportion 
to the reduction in entropy. We 
should conclude that the increase in 
r is most likely due to sheer manipu- 
lation. It should be pointed out here 
that increased reproducibility after 
combining categories can mean more 
than this. Such would be the case if 
respondents were differentiating be- 
tween the first category on an item 
and the other two according to their 
true scale position, but were distin- 
guishing between the second and 
third categories only by chance, or 
by habits of expression, or by any 
other extraneous factor. But before 
we could conclude that this sort of 
thing was happening, it seems reason- 
able that we should require that the 
errors, or nonfitting responses, be 
reduced as much as or more than the 
entropy, so that 


ps,/pe, = Hi/ Hs (3 


where pz, and pz, are the relative 
proportion of errors before and after 
combining categories, respectively. 
Lowering H means a greater structur- 
ing of the situation, and.as we have 
been deliberately structuring it in 
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our favor, we must demand not only 
a reduction in errors, but a reduction 
at least in proportion to this increased 
structuring. 

The proportion in formula 3 may 
also be written as 


Hs/ ps, & Hs/ Pes 


and this suggests the possibility of 
setting up a standard Z7/pg ratio 
with which to compare H to pg values 
obtained in practice. Taking into 
consideration the values of both r 
and H/ps would decrease the possi- 
bility of accepting as scalable an 
area which is not. It might also oc- 
casionally provide the evidence 
needed to accept as scalable an area 
about which we would have other- 
wise been in doubt. Choosing a 
standard ratio, call it H,/pzg, is 
largely a matter of deciding how con- 
servative we wish to be. It seems 
logical to choose pz,=.1, as this 
corresponds to an r of .90. We have 
a rough guide in choosing H, in Gutt- 
man's statement (10, p. 79) that at 
least ten items should be used if 
they are all dichotomies. If we as- 
sume that the 5;'s are normally dis- 
tributed, so that each p; is associ- 
ated with an equal portion of the 
area under the probability curve, 
H,/pz, then becomes 8.9/0.1=89. If 
we accept this standard ratio, then 
we would require an r of more than 
.90 should the entropy of the item 
series fall below 8.9. If, for example, 
our item series had an H value of 17.8 
(which would be extremely high), 
an r of .80 would most likely be as 
strong an indication of scalability as 
the r of .90 in the first example, al- 
though a conservative experimenter 
would probably prefer to exceed the 
minimum values for both r and H/pz. 

A previously mentioned situation 
which frequently occurs in practice, 
for which the above considerations 
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` helpful in interpreting these r's. - 


are also in order, is that in 
subset of items which seems | 
a scale is chosen out of | 
number of items ad c 
Choosing such a subset impos 

greater structuring on the sitü 
and produces a corresponding H dre 
As in the case of combining cat 
ies, H/ pg as well as r should be no 
before drawing conclusions. Whet! 
or not an experimenter choose 
use a standard H to pg ratio, tha 
gested above or any other, it 
seem advisable in any case to 
any manipulation, such as comb 
categories or picking out sub 

items, and the resulting M di 
along with the coefficients of 
ducibility, as this information 


When categories are combine 
subsets of items are picked o! 
cording to the usual practice, 
are actually two distinct proe 
going on simultaneously. First, 
is the increased structuring amd t 
concomitant decrease in entrop 
Second, there is the process of $ 
tion carried on by the experiment 
which he capitalizes on the 
tunities which are presented by 
data. The first process is an 
matic result of decreasing the nun 
of items or response categories ¢ 
would obtain no matter whether | 
combining of categories or selecta 
of item subsets is done selectively € 
entirely at random. It is the 
process, the automatic H drop, 
is adjusted for by the H/px ratio, 1 
there remains the problem of acci 
ing for the process of selection. 

It would seem that the only ! 
of scalability which allows f 
second factor is that of cross 
tion on a new sample. And ob 
the greater the H drop, the 
the possible influence of selecti 
therefore the more esssential It © 


i 


tomes to cross validate, Not only 
should the new coefficient of repro- 
ducibility and the new H drop, taken 
together, meet the minimum values, 
but it is also important that the items 
arrange themselves in the same order 
of difficulty or very nearly so. It is 
also desirable that the most ad- 
vantageous way of combining re- 
sponse categories be similar for both 
samples. 

There is another situation which 
does not occur often in practice, 
but which nevertheless is interesting 
from a theoretical standpoint. As- 
sume that we wish to interpret a co- 
efficient of reproducibility based on 
some other sample size than the 
usual 100. For some reason only 80 
respondents are available. Can we 
assume that 80 respondents will sup- 
ply us with 80 per cent as much in- 
formation as 100 respondents in the 
sense that the associated H values for 
the two testing situations would be in 
the ratio of 80 to 100? This assump- 
tion would not be justified. The proc- 
esses of combining categories and 
selecting a subset of items were both 
Selective processes, and thus a defi- 
nite structuring of tlie situation took 
place favoring a reduction of the pro- 
portion of nonfitting responses. But 
there is no such selection in the case 
above, and there is no reason to sup- 
pose that the proportion of errors will 
be changed.) Since the degree of 
Structuring is not changed, there will 


* As R. L, Thorndike has pointed out to the 
author, in extreme reduction of the sample size 
there will be an appreciable reduction in the 
Proportion of errors because of the fact that 
the responses among which we are looking for 
disagreement are also those which determine 
the scale values of the respondents. This 
. TPurious consistency is probably negligible, 
however, for values of N which are large in 
Comparison to the number of possible scale 
Values, which is equal to the number of items 
P'us one, 
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Sis 
be nochange in the associated entropy 
value. 


Thus it seems reasonable to as- 
sume that the mean of the distribu- 
tion function of r is not appreciably 
affected by moderate changes in N, 
although the standard error of r ob- 
viously will be. When comparing 
two r's based on different sample 
sizes, we need not make any allow- 
ance for different entropy values; we 
need only keep in mind that the 
smaller the N, the larger the stand- 
ard error of r. As present theory does 
not enable us to write the expression 
for this standard error, we can be 
safe only by avoiding sample sizes 
below the accepted standard ef 100, 

Table 1 gives the entropy values 
associated with dichotomous items 
for various differences between the 
response frequency proportions. 


TABLE 1 


EwrROPY VALUES ASSOCIATED with DiCHOT- 
omous ITEMS ror VARIOUS DE- 
GREES OF RESPONSE Bias 


.00 
10 
.20 
40 
“50 8113 
.60 
T0 
.80 
:90 
.00 


ore extensive tables of values for 
Moi oe p have been published (8), 
or if necessary, a table of common 
logarithms may be used to compute 
logsp values by use of the following 
relationship: 


logs p=logio p/logi 2=3.3219-logio $. 
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L. & Kirarzick, F. 


Scale analysis 


1. Epwarps, A. 
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NOTE ON SCORE TRANSFORMATION AND 
NONPARAMETRIC STATISTICS 


WALTER C. STANLEY 
Brown University 


It is becoming increasingly popu- 
lar to assess the statistical significance 
of markedly skewed data (e.g. la- 
tency of running or bar-pressing 
responses in rats) by means of non- 
parametric statistics. This is not sur- 
prising in view of the marked ease of 
calculation. However, one source of 


The smaller of the two sums of ranks 


TABLE 1 


ILLUSTRATIVE DATA ANALYZED IN Two Ways wrru WiLcoxox's Pame 
REPLICATES Test 


Subject | 
| Cond. A Cond. B 
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labor saving—working directly with 
raw data rather than with trans- 
formed scores—may sometimes be a 
mixed blessing. This could be the 
case when working with ranks of 
differences between paired scores. 
Here some score transformations 
would alter the magnitudes of the 
differences and thus their ranks. 
Consider Wilcoxon's (1) test for 
paired replicates applied to the raw 
scores in Table 1. These scores could 
be latencies in seconds of a running 
response in rats obtained under two 
conditions of extinction. The largest 
difference is in the negative direction. 
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assigned a negative sign, but here this 
rank is 1, and the p obtained is be- 
tween .02 and .01. 

Clearly some consideration must 
be given to the meaningfulness of 
scale units, and more generally, to 
the population of values to which 
one wishes to generalize in order to 
take full advantage of this "rapid 
approximate" statistical technique. 


REFERENCE 
1, Witcoxon, F. Some rapid approximate 
statistical procedures. Stamford, Conn.: 
American Cyanamid Co., 1949. 
Received December 26, 1953. 


517 


BOOK REVIEWS 


Oscoop, CnmamLes E. Method and 
theory in experimental psychology. 
New York: Oxford Univer. Press, 
1953. Pp. vii+800. $10.00 ($14.00 
trade edition). 


According to the preface, this book 
was written "to provide undergradu- 
ate majors and graduate students in 
psychology with a text that evaluates 
experimental literature in close rela- 
tion to critical theoretical issues." 
For such use it has several virtues. 
Not only are the theoretical issues 
about which the book is centered im- 
portant in their own right, but they 
also provide schemata that should 
help the student to comprehend and 
retain the myriad facts of experi- 
mental research. Osgood’s descrip- 
tion of many experiments is so mi- 
nute as to be the next best thing to 
the originals themselves. Further- 
more, within its range, the book re- 
flects with marvelous faithfulness the 
problems and atmosphere of experi- 
mental psychology as this is repre- 
sented in the latest volumes of Ameri- 
can psychological journals. 

This timeliness is apparently not 
due to unusual recency in the cita- 
tions. A sample of 100 was drawn 
from the 1,290 references in the 
bibliography; when these were dis- 
tributed by publication date, their 
median date was found to be 1937. A 
similar sample of 100 from the ap- 
proximately 1,800 references in 
Woodworth’s Experimental Psychol- 
ogy of 1938 (with which Osgood’s 
book is bound to be compared) 
yielded a median date of 1922. Thus, 
the median ages of the citations at 
the time of the publication of the 
two books were both 16 years. (Is 
there a law here?) | Woodworth's 
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references are, however, more si 
toward early dates than Osg 

On the other hand, the 
both massive and difficult. 
three pages on vacuum tube 
fiers and cathode ray oscillographs 
will be insufficient for the student who 
is untrained in physics and 
needed for one so trained. Th 
a bit of mathematics here and 
but probably not enough to fri 
the bright and well-motivated st 
dents for whom the book must 
been written. The first 300 pages} 
suppose rather extensive acqua 
ance with neural anatomy and! 
rophysiology; a few anatom 
drawings, including maps of € 
cytoarchitecture, would have 
helpful here. 

Then, too, for all of its 727 pi 
of text, the book is limited in 
The author says that he has covet 
*...the major part of what 
called experimental psychology, 
cluding sections on sensory pro 
perception, learning, and symbol 
processes." In 1938, Woodw 
had chapters on these topics 
three on feeling and emotion and 
each on “experimental estheti 
GSR, reaction time, attention, 
reading. The lack of material : 
motivation and action, which 
treated at least in part by such 
recent book as Stevens’ Handbook, 8 
well as other topics that other reader 
will no doubt miss, will be felt ket 
in any broadly conceived prosemin 
in which Oszood's book is used. It 
also likely to raise the tiresome qu! 
tion, what is experimental psyche 
ogy? E 

The word “method” appears in t! 
title but not as a heading in the e 
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ject index, and it can be said that the 
text deals more with method in 
particular than with method in gener- 
al. One extensive evaluation and 
comparison of methods as such does 
appear, namely, that on psychophys- 
ics, but this is not wholly satisfactory, 
The method of average error is de- 
scribed inadequately and, the review- 
er believes, incorrectly. The “flicker- 
fusion method” is given treatment 
coordinate with that of the standard 
methods even though it must itself 
employ one of these latter methods. 
Further, it appears that the author is 
more worried than he need be over the 
interpretation of the results of the 
two-category method of constant 
stimuli. 

To regard this work merely as a 
textbook or handbook would, how- 
ever, overlook its most significant as- 
pirations, which have to do with 
the evaluation of theory and the 
maintenance, in the author's words, of 
“a certain continuity of approach.” 
Now, it is notorious that experiment- 
al psychology does not easily submit 
to systematic unification, and we 
regretfully judge that this proposi- 
tion is not threatened by the present 
book. The theoretical atmosphere of 
the first six chapters, on sensory and 
Perceptual processes, is noticeably 
different from that of the remaining 
ten, on perceptual dynamics, learn- 
ing, thinking, and language. Osgood's 

ulian views have no relevance to 
Such matters as the cortical basis for 
Sensory quality and intensity, the 
quantal hypothesis in audition, and 
the significance of visual adaptation. 

hat little integration is achieved 
for Sensory psychophysics and psy- 
chophysiology comes from the au- 

or's interest in the mode of action 
of the cerebral cortex, where he 
Prefers neurostatistical conceptions 
to the "dynamics" of Köhler. Hel- 
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son's conception of adaptation level, 
which might have been used to link 
together some problems that remain 
unconnected, is not mentioned. 

Beginning, however, with the chap- 
ter on central factors in perception 
and continuing through most of the 
remainder of the book, consideration 
is given to major theorists. Tolman, 
Guthrie, the gestalt psychologists, 
and Hull are discussed extensively, 
and briefer accounts are given of 
Hebb, Skinner, and others. It is ob- 
vious that, in most instances, Osgood 
has made intense efforts to under- 
stand and to present sympathetically 
opinions with which he partly or 
wholly disagrees, as well as to exhibit 
in full relief the weaknesses of those 
by which he takes his own stand. 
How successful he has been in the 
exposition of each of these views will 
certainly be judged differently by 
readers of different theoretical pre- 
dispositions. The reviewer's belief 
isthat thesystemsof Hull (up through 
Principles of Behavior—the later 
books were too recent to be included), 
Guthrie, and Tolman are discussed in 
such a way as to give their followers 
little cause for complaint; indeed, 
Osgood has constructed challenging 
formalizations of the last two, with 
the aid of Voeks in Guthrie's case. 
Gestalt theory is more alien to him, 
and he makes the mistake of writing 
as if Köhler, Wertheimer, Koffka, 
Lewin, and J. F. Brown adhered to 
one system in common. 

Three pages are allotted to Hebb 
as an exponent of physiological the- 
ory. Unfortunately, the more purely 
psychological aspects of Hebb's ideas 
are lost sight of, so that, for exam- 
ple, his contributions to the the- 
ory of thinking are not referred to 
in Osgood’s chapters on this topic. 
Least satisfactory is the treatment. 
accorded to Skinner. His views on 
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the role of theory in psychology are 
omitted, and some of the remarkable 
regularities that he has established, 
which are clearly of concern to à 
learning theorist, are given little or no 
attention. This is true, for example, 
of the effects of different reinforce- 
ment schedules. Periodic reinforce- 
ment is mentioned in passing, with 
the comment that “The mechanism 
whereby an animal smoothly adjusts 
its rate of response under these condi- 
tions is unknown." “Intermittent 
reinforcement” is discussed more 
thoroughly, but Skinner's name is not 
mentioned in connection with it. The 
same is true of the problem of the 
number of kinds of learning. A 
rather misleading statement (p. 307) 
could cause the reader to suppose 
that Skinner accepts the concept of 
disinhibition. 

Osgood's own ‘mediation hypothe- 
sis," by which he hopes to effect a 
rapprochement between “reinforce- 
ment” and “cognitive” theories, is 
not a hypothesis in the ordinary 
sense, but rather a broad extension of 
Hull’s concepts of anticipatory goal 
responses and their associated pro- 
prioceptive stimuli. The following 
statement from the chapter on lan- 
guage behavior exemplifies one of the 
functions attributed to mediators, 
that of serving as signs: 

...a pattern of stimulation which is not 
the object is a sign of the object if it evokes 
in the organism a mediating reaction, this (a) 
being some fractional part of the total be- 
havior elicited by the object and (b) producing 
distinctive self-stimulation that mediates re- 
sponses which would not occur without the 
previous association of nonobject and object 
patterns of stimulation (p. 696). 

Such mediated self-stimulation has 
cue, motivating, and reinforcing 
properties. Mediators are not neces- 
sarily peripheral but may be purely 
cortical events; they are said to have 
the status of "hypothetical con- 
structs." The following comments 
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are made by the reviewer, with tl 
caveat that they should be taken a 
no more than fragments of a possible 
critical analysis of the hypothe 
(a) The mediators are so substitut 
able for the “ideas” of association: 
theory that occasionally the discuss 
sion sounds like a mere translation of 
such theory into Hullian language. It 
would be no defense to attack co 
tive theory on the same grounds. (| 
While the evidence submitted in 
vor of the hypothesis seems to indie” 
cate that some sort of mediation 
involved in the behaviors that are 
scribed, it does not, in the reviewer 
opinion, discriminate clearly in favo 
of fractional goal responses. (c) The 
explanations by mediators have am 
essentially chainlike character, 
are open to all of the objections 
have been raised against such a con 
ception since Dewey's paper on the 
reflex arc. (d) The attribution to me 
diators of both motivating and 
forcing properties leads to the sam 
extreme difficulties that leave € 
stimulus-reduction theory of rem 
forcement hanging by a thread on 
page 443. (e) One would like mi 
evidence that the hypothesis can ? 
dict new and unexpected phenomeni 
On questions of philosophy of sc 
ence and theory of knowledge th 
book is disconcertingly artless, am 
the thought is more energetic H 
subtle. Although Osgood calls him 
self a materialist, and says that “B 
haviorally, . . . the environment 
pattern of neural energies in the ce 
tral nervous system,” and that, WH 
one touches something, "The aw? 
ness of sensation is not... %. 
fingertips . . . but in the brain, 7 
nevertheless wrestles with the £108 
of introspectionism without M 
quishing them, since they remain 
haunt many pages of the book. Att 
same time, he shows no great 
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egn, almost anything that one might 
say about oneself is called “introspec- 
tion." For examples from other con- 
texts, Lloyd Morgan's canon is 
treated as if it were subject to experi- 
mental test; and the discussion of 
repression leads to the methodologi- 
cally awkward conclusion that moti- 
vated forgetting “ . . . is certainly a 
valid observation in the clinic and 
probably would be verified in the 
laboratory. ..." 

So ambitious a work can be ex- 
pected to contain some inaccurate 
and debatable matter. Thus, the all- 
or-none law is so stated as to make it 
depend upon the full utilization of 
"materials" in the neuron; in con- 
nection with the Talbot-Plateau law, 
it is asserted inaccurately that 
“with a light-dark ratio of one 
to one, the fused field will appear one- 
half as bright as the illuminated sec- 
tor’; the color solid, which some 
might regard as a minor triumph of 
taxonomy, is dismissed as a mere 
pedagogical device, apparently on the 
incorrect supposition that it was in- 
tended to represent the facts of color 
mixture; it is said that interference 
theories of extinction never specify 
the interfering responses, whereas a 
number of such responses were identi- 
fied two pages earlier in the descrip- 
tion of research by Wendt; a refer- 
ence to work by Birenbaum and Zei- 
garnik leaves the impression that 
Lewinian theory presumes that 
boundaries between regions within 
the person are less permeable in the 
child than in the adult, when in fact 
the theory asserts the reverse. 

The book is seriously marred by 
humerous errors of grammar, spelling, 
and typography, as well as other 

lemishes of which some are matters 
of taste. It is regrettable that most 
of these were not weeded out by the 
publisher's editing. A few miscel- 
aneous examples follow. 
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101. It may also be men 
type seems very small for the length 
on the large single-column page. 

The foregoing, however distress- 
ing, are minutiae, while Osgood's 
accomplishments are not. Scattered 
throughout the book are original 
theoretical and experimental contri- 
butions on topics ranging from color 
contrast to transfer and retroaction in 
learning. Several chapters are unusu- 
ally interesting, e.g. Chapter 7 on 
perception and Chapter 14 on prob- 
lem solving. Furthermore, the author 
is at his best in assembling the evi- 
dence for and against testable research 
hypotheses and doggedly following 
the trail of the experiments even when 
they lead in an undesired direction; 
and this is, after all, his principal 
intention. 


a 


Francis W. IRWIN. 
University of Pennsylvania. 


ROBERT, J., & AL- 
BRECHT, RUTH. Older people. New 
York: Longmans, Green, 1953. 
Pp. xvi+415. $5.00. 

This book is an admirable antidote 
to the prophets of doom and gloom 
who identify aging with senility, disa- 
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bility, and frustration. It is based on 
a detailed survey of a small midwest- 
ern city (7,000 in the town itself and 
4,000 in the adjacent trading area) 
and presents factual data about what 
the older people there are like (670 of 
them 65 or older). Individual inter- 
views were carried out with a sample 
of 100 of the old people drawn to 
represent the entire old age popula- 
tion by sex, socioeconomic status, and 
marital status. 

'The book, however, is more than a 
compilation of factual information. 
It adds a conceptual framework for 
the analysis of the needs and desires 
of individual aging people, an analy- 
sis of the community's reaction to the 
aged, and a formulation of basic 
principles essential to the effective 
planning of programs for the aged. 
Writing in an easy, flowing style, 
with enough ancecdotal material to 
maintain a lively interest, the authors 
manage to impart a great deal of fac- 
tual information without the reader's 
being aware of it. 

The book should be general in its 
appeal, not only to those who are 
especially concerned with the prob- 
lems of aging, but also to the aging 
people themselves. Thus, the physi- 
cian who may be exposed daily to 
the aches and pains of older people 
will be agreeably surprised to learn 
that 79 per cent of the oldsters in this 
cultural environment regarded them- 
selves as “healthy” and that only 6 
per cent were homebound and 2 per 
cent actually bedridden. The social 
worker, who must deal with the prob- 
lem of finances, living arrangements, 
etc. among the aged, ought to know 
that 43 per cent of this population 
reported they were happily situated 
and that only one-fourth were actu- 
ally unhappy. It is also important to 
know that happiness was not related 
to economic status and that health 
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became a major component of happi- 
ness only in the 20 per cent of the 
population who had specific com- 
plaints; i.e., old age was what the in- 
dividual made it, with economic and 
health aspects assuming a secondary 
role. All but 4 per cent of the popula- 
tion reported that they had enough 
money to get along on provided they 
had no unusual medical expenses. 
Anxiety about what they would do in 
case of protracted illness requiring 
hospitalization was present in a large 
proportion of the elderly people. 
There was no evidence that the com- 
munity rejected old people because of 
their age. However, the importance 
of the role of the individual in the 
community was emphasized. In gen- 
eral, the community seemed quite per- 
missive about what it expects of older 
people and looked with favor on con- 
tinued activity. 

The authors present a number of 
concepts that may require further 
analysis. For example, their outline 
of the meaning of work shows the 
need for a variety of retirement plans 
if we are to meet the needs both of 
the individual and of society. The 
concept of socioeconomic mobility 1$ 
applied to the psychological adjust- 
ments of aging people in an inter- 
esting manner. Whether all of the 
conceptual formulations will prove 
useful remains to be seen, but the au- 
thors are to be complimented on their 
willingness to theorize and thus re- 
duce a mass of specific observations to 
some intelligible formulation. ^ , 

Psychologists with a quantitative 
psychometric viewpoint will be dis- 
appointed by the use of nonsca 
questionnaire techniques, and sociolo- 
gists may be disturbed by the inclu- 
sion of a chapter on “A Personal an 
Social Philosophy of Old Age” whi 
makes specific recommendations 
about “rational defenses” that can 
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used by older people. One might also 
quibble about whether the popula- 
tion of Prairie City is representative 
of the entire United States. How- 
ever, the solid facts of how one group 
of aging people behaved in a definable 
environment have been admirably 
explored by this study. Only by simi- 
lar studies in different community 
environments can the generality of 
the findings be settled. 

It is good to know that in some 
community structures a large propor- 
tion of the older people can optimisti- 
cally meet the problems of aging. 
This is a book that should be read by 
everyone interested in the welfare of 
his own community. 

N. W. SHock. 

Baltimore City Hospitals. 


FEDERN, PAUL. Ego psychology and 
the psychoses. (E. Weiss, Ed.) New 
York: Basic Books, 1953. Pp. 375. 
$6.00. x 


The editor had the difficult task of 
organizing the presentation of a sys- 
tematic theory of ego functioning out 
of another man’s half-century (1901— 
1952) of prolific publishing (almost 
100 articles, lectures, etc.). The 40- 
year association as pupil and friend 
of Paul Federn provided important 
background for this task. In the 21- 
Page introduction, Weiss attempts to 
establish Paul Federn’s orthodox 
loyalty to Dr. Freud (to take care of 
the "minor" differences in the area of 
ego theory), to apologize for Federn's 
difficulties in exposition which is“ . . . 
Very rich in content but... often 
complex etc.” (p. 21), and to present 
à condensed, clarifying guide to the 
metapsychological wilderness of the 
text. He apparently did not feel that 
Federn could provide in 16 articles 
and 340 pages a sufficiently clear 
Presentation of his thoughts, but un- 
fortunately the guide itself needs a 
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guide. The first five “theoretical 
papers” are a maze of unclear verbal 
gymnastics, poorly defined concepts, 
and repeated self-adulation by 
Federn which does not sufficiently re- 
ward the reader for his difficult labor 
of seeking meaning. In this part of 
the book, Federn shares a difficulty 
common to many psychoanalytic au- 
thors who consider operationalism a 
primitive scientific procedure and 
prefer an artistic verbal medium. 

The second part of the book (nine 
papers) is a fuller measure of Dr. 
Federn's contribution. Here, his rich 
clinical experience and observational 
astuteness appear to good advantage. 
A conception of the ego differing in 
many respects from that postulated 
by Freud is presented. “Ego Feeling" 
seems to be viewed as a basic energy 
which integrates ego functioning and 
makes ego experience possible. An 
interesting hypothesis suggesting a 
somatic and mental differentiation of 
ego feeling is developed. There is a 
much-repeated emphasis on psychosis 
as an ego-defeat phenomenon, as 
against the neurotic dynamics of ego 
defense. The so-called “Steinach 
effect” is defended as a sufficient 
basis for the sterilization of latent 
psychotics and psychotics. However, 
the psychological rationale given in 
Federn’s papers hardly constitutes a 
convincing argument. 

This book is not the “Vade Medi- 
cum" for the treatment of psychosis 
which the publishers enthusiastically 
proclaim on the book-jacket blurb. It 
is certainly not recommended for 
general reading. However, if the 
reader is familiar with psychoanalytic 
terminology and has the perseverance 
to follow Federn through "thick and 
thick," then he may find in the papers 
many provocative speculations based 
on extensive clinical experience. 
There might even be the reward of 
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deriving fruitful hypotheses for ex- 
perimental investigation from some of 
these speculations. 

M. ErIK WRIGHT. 


University of Kansas. 


McCurpy, HaroLp Grier. The 
personality of Shakespeare. New 
Haven: Yale Univer. Press, 1953. 
Pp. xi+243. $5.00. 

Almost twenty years ago Caroline 
Spurgeon published the results of her 
extensive study of Shakespeare’s 
figures of speech from which she 
drew a number of inferences concern- 
ing Shakespeare's interests, preoccu- 
pations, attachments, personality 
traits, personality development, and 
the like. Her study was based upon 
the premise that a person reveals 
things about his personality by the 
kinds of metaphors he uses. 

McCurdy has tried to get at the 
personality of Shakespeare by a dif- 
ferent route. He has counted the 
number of lines that every character 
in every play speaks in order to de- 
termine the principal characters in 
each play. The assumption is made 
that the more important features of 
Shakespeare's personality are repre- 
sented in the principal characters. In 
addition to character analysis, Mc- 
Curdy has analyzed some of the ma- 
jor themes that run through a num- 
ber of plays much as one would 
analyze a TAT protocol. 

For anyone who admires Shake- 
speare and who feels that the psy- 
chologist has something to offer in the 
way of method for shedding more 
light upon the kind of person that 
Shakespeare must have been in order 
to write as he did, this is a fascinating 
study. Dr. McCurdy is dedicated to 
his subject and writes with sensi- 
tivity and critical devotion. 

Most psychologists do not quarrel 


1 Shakespeare's imagery and what it tells us. 
New York: Macmillan, 1935. 
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with the hypothesis that à 
projects himself into his in 
and that it should be possible ther 
fore to make a personality analysis 
a writer from his writings alone, Th 
same hypothesis probably holds 
painters, composers, architects, 
signers, and anyone who prod 
something out ci his imag 
However, Shakespeare is the worst. 
possible subject on whom to test the 
hypothesis. ^ Virtually noth 
known about the man. In fi 
little is known that a numb 
people believe that someone € se 
must have written the plays 
tributed to Shakespeare. Such be 
the case, how can one ever hope 
confirm or infirm the inferences t 
are made about the personali 
Shakespeare from his writi 
I wish that Dr. McCurdy had 
voted his considerable talents both 
a psychologist and as a literary 
to the examination of a writer à 
whom a great deal is known. 
ingway or Mickey Spillane 
make excellent choices. Is it true, tor 
instance, that Mike Hammer rep 
sents the shadow-side of Spilla 
personality? How much of the old 
man of the sea is to be found 
Hemingway? McCurdy's method 
of analysis would undoubtedly 
pay dirt and reveal a great deal 
the nature of projection if he 
them on more appropriate subj 
CaLviN S. HALE. 
Western Reserve University. i 


STOLUROW, LAWRENCE MEA 
Readings in learning. New 
Prentice-Hall, 1953. Pp. vill 
$6.00. 


Stolurow has collected 42 
graphical items from the f 
learning, either in whole or in | 
and by combining three into on 
and two into another, has p 
39 articles under one cover. 


pose was to provide a sample of origi- 
nal material to supplement secondary 
source | .aterials since "only by read- 
ing and analysing original reports can 
the student learn how to conduct a 
variety of different types of research 
and at the same time become aware of 
the problems and difficulties in- 
volved.” Presumably this book was 
intended to serve as a second text in 
a course in theories of learning or per- 
haps as a source book in a course in 
which no text was used, although 
Stolurow suggests that, "Where a 
laboratory is available, this volume 
could serve as a laboratory text. The 
experimental studies could be used as 
models for both experiments and 
report writing, and the theoretical 
articles as bases for new studies.” 

It is a fairly safe estimate that at 
least 5,000 bibliographical items have 
been published in the past 70 years 
from which Stolurow chose 42. His 
selections could well be analyzed in 
terms of whether they are essentially 
systematic as opposed to experimen- 
tal articles, in terms of how well they 
represent the various problem areas 
that have been of concern to people 
interested in learning, in terms of how 
well various theoretical points of view 
are represented, and in terms of 
whether these particular items are in 
fact good models. 

_ The book is divided into eight sec- 
lions or chapters. The first of these, 
titled “Some Systematic Positions," 
contains six units representing eight 
of the 42 articles selected. These 
eight and seven others in later chap- 
ters, something over 35 per cent of the 
selections, are essentially nonexperi- 
mental. In addition, Every chapter 
and article contains some theory. 

hisis a sign of the times." Thus the 
ook is very heavily weighted in favor 
9f theory and its exposition, although 
it does contain 27 detailed research 
reports, 


Stolurow's selections are not repre- 
sentative of points of view. He re- 
ports that he based his selections on 
five points of quality and the addi- 
tional criteria that selections should 
be recent rather than old, human 
rather than animal, and S-R rather 
than gestalt. His chapter on system- 
atic positions has selections from 
Thorndike, Hull, Guthrie, Estes, and 
Skinner, representing S-R positions, 
and Tolman, who is somewhat diffi- 
cult to classify. The other systematic 
or nonexperimental articles are by 
Hull, Mowrer, Pavlov, Skinner, 
Spence, Tolman, and Woodworth, 

Aside from the S-R bias, which is 
confessed and deliberate, the nature 
of the selection process is partially 
revealed by who is omitted. There 
are, by rough count, 947 references 
cited in this list of 39 items. Thirteen 
men have ten or more references on 
this list. Stolurow has selected arti- 
cles from eight of these men, but has 
not selected any from the writings of 
Hilgard, Krech (Krechevsky in the 
bibliography), McGeoch, Maier, or 
Neal Miller. Each has published at 
least one "classic" article in the 
period from which these items were 
selected. Twenty-two were published 
between 1947 and 1950 and the re- 
maining 20 between 1928 and 1946. 

The 27 articles which report experi- 
ments in detail are heavily weignted 
in favor of experiments involving hu- 
man subjects (18) as against those 
involving animals (9). This weight- 
ing follows largely from the choices 
of subject matter represented. There 
are two experimental articles on con- 
ditioning concepts and techniques, 
four on motivation and reinforce- 
ment, six on motor and verbal learn- 
ing, three on discrimination and 
perceptual learning, two on educa- 
tional and social learning, five on re- 
tention and forgetting, and five on 


_ transfer. 
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As models of research design, there 
can be little quarrel with Stolurow's 
selections. They are some of the 
better examples of the kind of article 
usually found in the Psychological 
Review, the Journal of Experimental 
Psychology, and the Journal of Com- 
parative and Physiological Psychology, 
from which most of his selections 
were taken. They are also typical of 
the writing style of these journals 
and are good models if one wishes to 
perpetuate this style. If one wishes 
to teach a more intelligible style, 
some of these selections can be used 
as examples of what not to do. 

The book is printed by a photo- 
offset process. The right-hand mar- 
gins of the pages are not justified. 
The paper used is sufficiently trans- 
parent that printing from the back 
of the page and from the next page 
shows through. The binding is a hard 
cover, but it is covered with paper 
rather than cloth and the spine of my 

' copy is broken on both sides from the 
handling it has received in preparing 
this review. It is a shabby job of 
book making and, from this stand- 
point, overpriced by a very wide 
margin. 

Epwarp L. WALKER. 

University of Michigan. 


PsyCHOTHERAPY RESEARCH GROUP, 
PENNSYLVANIA STATE COLLEGE, 
(Wm. U. Snyder, Chairman). 
Group report of a program of re- 
search in psychotherapy. State Col- 
lege: Pennsylvania State Coll. 
Press, 1953. Pp. iii--179. $2.25. 


Regardless of divergent viewpoints 
about the merits of client-centered 
counseling, much credit is due Rogers 
and his students for their pioneering 
.and ingenious research with ver- 
batim interview recordings. This 
report is part of a sequence which 
began little more than a decade ago, 
but it already represents a second 
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generation effort, being i 
laborative production of nim 
toral students directed by 
U. Snyder. The report consi 
description of the research 
nine thesis condensations, à $i 
and discussion by Dr. Snyder, 
ography, and a detailed apper 
that provides the rating and eo 
procedures utilized in the studi 

The basic sample of cases $i 


maximum N samples were 
depending upon various selecti 
teria, e.g, number of in 
transcribability of intervie 
taken, etc. In addition to 
based upon the recorded intet 
use was made of pre- and postcou 
ing tests. These tests were th 
schach, the MMPI, and the Moor 
Problem Check List. Further 
were provided by ratings mad 
clients, counselors, and independe 
judges using such devices as a po' 
counseling client scale, a 
personality scale, and a posttherap) 
counselor check list. Somewha i 
prisingly in view of recent trend 
Q sorts were employed. j 
Among the topics investigated s 
the reasons for early dropouts, 1 
relationships between counselor 
client characteristics, the el 
ment of a composite criterion 
measuring client progress, tM 
dictability of client verbal beh 
during counseling, indices of 
ance, and comparison of the 
teristics of more and less SU 
cases. 
A number of new hypothe 
variables have been introaue 
these studies and, despite th 
that most of the findings are’ 
clusive, the report is an 1mpc 
contribution to research me 
ogy. Many readers will doub 
disturbed by the continuing < 
offered for the position that t 
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tiveness of therapy can be evaluated 
without the necessity of external cri- 
teria. 
LEONARD S. KOGAN. 
Institute of Welfare Research, 
Community Service Society of 
New York. 


STEPHENSON, W. The study of behav- 
ior. Chicago: Univer. of Chicago 
Press, 1953. Pp. ix+376. $7.50. 


Ií Stephenson had set forth his 
purposes in writing this book, which 
carries the subtitle “Q-technique and 
its methodology,” the reviewer's task 
would be easier. Our inference is 
that his aims were the following: (a) 
to challenge much of current method- 
ology in psychology, (b) to explain 
Q-methodology and (c) to show by 
illustration how it can put psychol- 
ogy's “house in scientific order," and 
(d ) to demonstrate that theory test- 
ing and scientific conclusions are pos- 
sible on the basis of a single case. 
Perhaps when the author speaks of 
the "platform upon which we are to 
campaign," he is telling us that his 
sole aim is to promote Q-methodol- 
ogy. 

We shall not attempt to list here 
all of the concepts and all of the 
people against which and whom 
Stephenson arrays himself for battle. 
He does not have any faith in ordi- 
nary factor analysis (R-methodol- 
Ogy), in measurement, in norms, in 
large samples, or in any so-called 
generalizations springing therefrom. 
He admits that he alone is “in step 
and all others out" (p. 348), but this 
does not keep him from citing what- 
ever supporting fragments he can 
find, whether these be found in the 
Writings of J. R. Kantor or of J. M. 
Keynes or of some very obscure per- 
_ Son. His sallies, courageously set 

forth, will be found either interesting 
Or irritating, according to the pro- 
divity of the reader. 


$27 


With regard to purpose b, one would 
expect that an author who complains 
because such intellects as Godfrey 
Thomson and Cyril Burt have mis- 
understood his writing would make a 
special effort at clear and concise 
exposition. Instead, we find a poorly 
organized, piecemeal presentation, 
more confusing than enlightening. 
Thus, we can anticipate continuing 
misunderstanding, and consequent 
misuse of Q-methodology. 

The second half of the book is 
devoted mainly to applications of Q- 
methodology in the areas of type psy- 
chology, questionnaire analysis, so- 
cial psychology, self-psychology, per- 
sonality, projective tests, and clinical 
psychology (a chapter to each area). 
We are told that 'Q-technique has 
its applications in almost every nook 
and cranny of psychology in its re- 
search aspects” (p. 338) and "in every 
branch of psychology where behavior 
is at issue" (p. 343). If we are to 
judge from the given illustrative ap- 
plications, the quoted claims repre- 
sent wishful thinking. 

Clinicians and others who, by 
necessity or by choice, deal with a 
single case will find comforting re- 
assurance and be motivated to read 
further by "We are to work, instead, 
with a single person, at the call of a 
theory. Yet we shall reach valid, 
scientific conclusions" (p. 5). The 
continuous stress on the merits of 
the single case leads ultimately to “In 
principle, orte may work scientifically 
for a lifetime with a single case" (p. 
343). Unfortunately, by the time 
one has spent a lifetime developing a 
set of principles for predicting (or 
explaining) every fragment of be- 
havior of a single case, the subject 
will have ceased to behave. Or 
another logical conclusion to this sort 
of thing is that psychologists must 
develop two and a half billion "'sci- 
ences” to explain the behavior of the 
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two and a half billion human inhabi- 
tants on this planet! What of the 
task of animal psychologists? 

The fact that our comments have 
been restricted to general reactions 
should not be misconstrued as indi- 
cating that the margins of the re- 
viewer's copy of the book are free of 
specific questions. Far from it. 

QuiNN McNEMAR. 

Stanford. University. 


GOLDHAMER, HERBERT, & MARSHALL 
ANDREW W. Psychosis and civiliza- 
tion. Glencoe, Ill.: The Free Press, 
1953. Pp. 126. $4.00. 


This slim volume consists of a re- 
print of two statistical studies in 
the frequency of mental disease. The 
first and more significant investiga- 
tion is concerned with an analysis of 
admissions to mental hospitals in 
Massachusetts and Oneida County, 
New York, extending back to 1840. 
Consistent with data previously re- 
ported by others for the past half 
century, the present authors found 
that for age groups under 50 there 
has been no increase in the frequency 
of psychoses over the past 100 years. 
This finding should put an end to 
the recurring myth that psychoses 
are a product of the stress and strain 
of modern life. The second paper 
presents expectancy rates of mental 
disease. It differs from earlier studies 
in that the tables prepared state the 
risk of admission to a mental hospital 
between any two points of an indi- 
vidual’s life. This method of presen- 
tation serves to accentuate the high 
incidence of admissions for the older 
age group. If a 60-year-old male sur- 
vives to age 85 he runs a 10 per cent 
risk of being admitted to a mental 
hospital. 

James D. Pace. 

Temple University. 
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FRENCH, THOMAS M. The integration 
of behavior. Vol. II. The integrative 
process in dreams. Chicago: Univer. 
of Chicago Press, 1954. Pp. xi 
+367. $6.50. 

When the history of twentieth cen- 
tury psychology is written, it will un- 
doubtedly be characterized as the 
period when two great streams of 
psychological thought, one flowing 
from the laboratory, the other from 
the clinic, converged and ran to- 
gether to form a unified science of 
dynamic psychology. The future 
historian will observe that the dialec- 
tical process which eventuated in the 
synthesis of experimental and clinical 
psychology took a long time and had 
to overcome many obstacles. Among 
the obstacles that he will discuss, one 
at least is apparent to us today, 
namely the insularity of the pro- 
ponents of each of these major orien- 
tations. This insularity prevents one 
side from communicating with the 
other, so that each remains ignorant 
of what the other stands for. 

Fortunately for psychology there 
are indications that the iron curtain 
of insularity is being lifted and that 
a real exchange and integration of 
ideas are beginning to take place. 
The name-calling era is drawing to 
a close. Psychologists brought up 1n 
the tradition of experimental psy- 
chology are reading and being insemi- 
nated by psychoanalysis, and psycho- 
analysts, though to a lesser extent, 
are reading and being inseminate 
by experimental psychology. An out- 
standing example of a psychoanalyti- 
cally trained investigator whose 
thinking has been fertilized by inter- 
course with systematic experimenta 
psychology is Thomas French, asso" 
ciate director of the Chicago Institute 
of Psychoanalysis. ; 

In a number of articles published 
during the last 20 years, Frenc 
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demonstrated his ability to synthe- 
size the two orientations. Now, in 
his impressive work in progress, three 
volumes of which are still to be pub- 
lished, we are privileged to witness 
the culmination of his integrative en- 
deavors. The present volume is an 
application of the basic postulates 
set forth in Volume I to an under- 
standing of dreams. 

Essentially, what French has done 
is to graft Tolman's cognitive theory 
onto Freud's motivational theory. 
French's key concept is cognitive 
structure, by which he means ‘a 
hierarchy of plans for achieving an 
end-goal." Implicit in this definition 
is the concept of motive since every 
plan involves striving toward a goal. 

French not only believes that every 
dream has a cognitive structure but 
also that the cognitive structures of 
different dreams of the same person 
form a pattern. The cognitive struc- 
ture of a dream is discovered by using 
various sources of evidence, namely, 
information about the dreamer, free 
associations, translation of symbols 
by functional analysis, and compari- 
sons of one dream with the other 
dreams of a series. All of this infor- 
mation is blended together by em- 
ploying the method of internal con- 
sistency, which is the favorite method 
of psychoanalytic investigators. The 
end result is a comprehensive under- 
standing of the dreamer's conflicts, 
their relative intensities and inter- 
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relationships, their historical origins 
and contemporary significance, their 
underlying physiological and motiva- 
tional patterns, the dreamer's plans 
and capacities for resolving them, 
and his hopes of success. 

The rhetorical strategy of the book 
consists of discussing the dreams of 
a patient who is undergoing the type 
of psychoanalytic treatment prac- 
ticed at the Chicago Institute of Psy- 
choanalysis. Some readers may ques- 
tion whether the strategy is a success- 
ful one. This reader found that it be- 
came quite tedious to try to follow 
and bear in mind all of the intricacies 
of analyzing the dream series. It 
may well be that every idea is related 
in some way to every other idea, and 
that there is almost an infinity of 
components and levels in the cogni- 
tive structure of a particular human 
mind, but I wonder whether there is 
not a better way to get this across. 
Perhaps not. It seemed to me that 
there was an unconscionable amount 
of redundancy and that more severe 
editing could have made the book 
more readable. French's use of dia- 
grams is an aid to quick understand- 
ing and should have enabled him to 
abbreviate the extended discussions. 

In spite of these literary defects 
the book is a solid contribution to the 
major task of twentieth century psy- 
chology. 

CarvIN S. HALL. 

Western Reserve Ui niversity. 
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RESEARCH ON SENSORY INTERACTION IN THE 
SOVIET UNION! 
IVAN D. LONDON 
Brooklyn College 


Since the early thirties the general 
area of sensory interaction has been 
the object of considerable attention 
by researchers in the Soviet Union.! 
However, owing to the language 
barrier and the near unavailability 
of the literature, most of the experi- 
mental and theoretical work that has 
been done in this field has gone un- 
noticed. Perhaps another factor tend- 
ing to keep western attention averted 
has been the discouragement induced 
by the difficulty encountered in at- 
tempting to duplicate the reported 
findings. 

It is true that much of the Soviet 
Work on sensory interaction adheres 
to standards of execution, reportage, 
and interpretation that would be 
quite unacceptable to the western 
researcher. Asa matter of fact, even 
à casual survey of the Soviet litera- 
ture yields ready evidence of inade- 
quate instrumentation and method- 

‘Writing of this paper was initiated in 
1952 in the course of association with the Har- 
vard Project on the Soviet Social System, AF 
No. 33 (038) 12909, Russian Research Cen- 
ter, Harvard University. The writer grate- 
fully acknowledges the subsequent support 
of the Russian Research Center. 

? Some western students of Soviet psychol- 
ogy have seen in this the hand of Marxist 
ideology (13, pp. 161-162). The real reasons 
or the concentration of research in this 
area, however, are probably more on the 
Pedestrian side. The writer discusses them in 
4 paper to be published elsewhere. 


ology, scanty detail, and a primitive- 
ness in the statistical treatment of 
data which makes anything beyond 
an arithmetic mean a rare encoun- 
ter.* 

Nevertheless, western work on 
sensory interaction has been, in the 
main, scattered and desultory, where- 
as in the Soviet Union the subject 
has been given systematic and sus-' 
tained attention (172, p. 77). For 
this reason alone a survey of the 
claimed experimental findings and of 
the correlative theorization seems to 
be in order. 

The writer proposes first to outline 
representative basic findings, then 
to discuss the conditions that are 
said to affect interaction among the 
sense organs, and finally to sketch 
the theory that has been developed 
to account for this interaction. By 


3 The published reports and resolutions of 
various professional meetings (not to be con- 
fused with the stage-managed affairs of the 
Lysenko variety), along with the abstracts of 
the discussions following the reading of papers, 
provide a frank admission of some of 
deficiencies (5; 15, p. 805; 59; 109; 116-118; 
172, p. 55; 198, p. 387; 347; 382; 484; 485; 
486, p. 249; 487, p. 157; 488, p. 623; 489, 
pp. 650-651; 492, p. 715; 495, p. 330). Re- 
cently standard deviations and coefficients of 
correlation have been making a spotty reap- 
pearance after their exit from the psychologi- 
cal scene in 1936. Tests of significance are still 
only occasionally encountered (13; 88; 197, p. 
45; 257, pp. 79-80; 335; 339, p. 402). 
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research in sensory interaction* will 
be meant generally investigations 
that explore modifications of response 
in one sense organ under direct 
stimulation when another sense organ 
has been or is subject to its own char- 
acteristic stimulus. The latter stimu- 
lus is referred to in the Soviet litera- 
ture as "indirect" (nepriamy?) or 
“accessory” (pobochnyt)—the last be- 
ing the preferred term in this paper. 
Excluded for the most part from the 
survey will be intersensory research 
on (a) sense organs in pathological 
states (12, 75, 77, 153, 196, 475—477, 
479); (b) nonsensory reactions (elec- 
trical and ionic excepted) of the sense 
organs;* (c) binaural interaction (1, 
2, 197); (d) binocular and other intra- 
modal interaction (pertinent excep- 
tions being made, however) ;? (e) the 
effect of pain, as the accessory stimu- 
lus, on sensory response (375, 393, 
474); and (f) the influence of intero- 
ceptive stimulation and dermal ir- 
radiation (62, 63, 135, 162, 179, 185, 
187, 191, 314, 315, 317, 321, 355). 
The writer wishes it to be understood 
that all statements of ostensible fact 
in the paper refer to data or inter- 
pretation of data generally accepted 
as valid in the Soviet literature. The 
survey covers the period from 1930 
to the present. 


CHANGES IN SENSORY RESPONSE 
UNDER ACCESSORY STIMULATION 


Absolute Sensitivity of Peripheral 
Vision 
The absolute sensitivity of periph- 
eral vision was early shown to under- 


4 Literally, interaction of the sense organs 
(vzaimodetstvie organov chuvstv). 

5 E.g., modifications of intraocular pressure, 
angioscotomata, etc. (384-388, 475-478). 

* E.g., binocular contrast, various proc- 
esses of retinal induction and sensibilization, 
simultaneous olfactory stimulations, etc. 
(3; 43-47; 48, pp. 209-215; 49; 53; 54; 111; 
113; 128; 194; 195; 274, pp. 258-259; 451; 
481). 
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go modification with excitation of the 
nonvisual organs (34). Modifiability 
of peripheral sensitivity was demon- 
strated also with excitation of the 
central visual system which, together 
with the peripheral system, consti- 
tutes the visual apparatus. Thus, il- 
lumination of the macular field was 
found to reduce sensitivity of the 
peripherally placed rods shortly after 
cessation of the stimulus (68, 73, 75, 
93, 172, 286, 294, 326, 402, 403). 
The antagonistic relationship sur- 
mised to obtain between the central 
and peripheral retina has been the 
subject of considerable study with 
results confirming and extending 
earlier findings (325, 329, 357-359, 
403, 480). For example, it was noted 
that illumination of the central 
retina has no effect on the retinal 
periphery in the color blind, whereas, 
in those whose cone apparatus is non- 
defective, characteristic modifications - 
of peripheral sensitivity are to be 
observed (265, p. 30; 357; 358). 
Thus, the reciprocal nature of the 
cone-rod relationship, which Orbeli 
had earlier hypothesized, came to be 
an accepted fact (69; 233, p. 413; 
256, p. 47; 285; 371; 436-438). 
While macular illumination, after 
its cessation, induces a reduction 1n 
peripheral sensitivity, a subsequent 
increase was found. ‘Thus, aif 
macular illumination with moderate- 
ly bright light for one to ten minutes, 
a decrease in peripheral sensitivity 
for the first few minutes is exhibite " 
followed, however, by à period E | 
supernormality (286-288, 400, 401). 


1 The first phase is considered as an in 
stance of inhibition; the second, as one i 
disinhibition. However, “sensibilization . 
is not considered to be derived always aho 
disinhibitory processes, Thus, the — 
action of threshold stimuli upon periP el ral 
vision is ascribed to the increase © pes 
excitability caused by weak excitation Sol 
visual apparatus (274, pp. 129-130, 185- 
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Not only does macular illumination 
of one eye effect a reduction in the 
sensitivity of its retinal periphery, but 
a diminution is to be observed like- 
wise in the companion eye. Further 
study, however, revealed that more or 
less prolonged adaptation of one eye 
to moderately bright light results in a 
heightened peripheral sensitivity in 
the other, while increased central 
sensitivity is in turn reported sub- 
sequent to peripheral illumination. It 
was also established that preliminary 
adaptation of one eye to red light of 
low intensity heightens very marked- 
ly the peripheral sensitivity of both 
eyes (85-88; 117; 120; 143; 209; 
216-218; 274, pp. 182-186; 286; 288; 
289; 291; 292; 400; 402; 444; 472; 
475). 

Pronounced effects on the peripher- 
al sensitivity of the visual organ have 
been demonstrated as a consequence 
of auditory stimulation (32, 78, 82, 
137, 161, 178, 180, 183, 342-344, 345, 
405, 446). For example, it has been 
Shown that peripheral sensitivity, as 
a rule, declines on exposure to sounds 
of average or above average inten- 
sity. According to report, the noise 
9f an airplane motor actually drops 
peripheral sensitivity to as low as 20 
per cent of the level obtaining under 
conditions of quiet (32). However, a 
brief period of hyperventilation is 
held to restore peripheral sensitivity 
In spite of continued auditory action 
(94; 172, p. 118; 295; 296). Instances 
of heightened peripheral sensitivity 
have also been claimed (304, 319, 
345), but later investigators have at- 
tributed these departures from the 
general rule to deviations from the 
Subject's normal physiological state 
(137; 188; 274, pp. 186-187). How- 
ever, exposure to ultrasonic frequen- 
cies (e.g. 32,800 cycles/sec.) has 

een reported to increase peripheral 
Sensitivity (188). 
As in the case of macular stimula- 
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tion, the same two-phase sequence of 
initial depression and subsequent ele- 
vation of peripheral sensitivity above 
normal—the latter phase a rather 
prolonged one—is to be discerned 
with auditory stimulation. This is 
true also when ultrasonic frequencies 
are employed (188; 274, p. 190). 

Olfactory stimuli of various kinds 
have been observed to influence the 
level of peripheral sensitivity of the 
visual organ. The odor of bergamot 
oil and of pyridine in toluol, for exam- 
ple, are held to heighten peripheral 
sensitivity, although, where there 
were strong negative reactions to the 
latter, instances of lowered sensitivity 
are reported. Spirits of hartshorn are 
also said to increase peripheral sensi- 
tivity (48, pp. 281-286; 161; 183; 
260; 274, p. 188; 345). 

Similarly, gustatory stimuli of 
various kinds have been observed to 
influence the level of peripheral sensi- 
tivity of the visual organ. Weak 
sweet, salt, and acid solutions bring 
about an increase in peripheral sensi- 
tivity, whereas quinine brings on a 
decrease. The latter effect is, as 
above, attributed to the negative 
reaction of the subject to the bitter- 
ness of quinine (161, 186, 345). 

Stimulation of the cold receptors 
is reported to produce a marked in- 
crease in peripheral sensitivity of the 
visual organ. For example, one 
minute after facial application of 
cool water, peripheral thresholds drop 
markedly. On the other hand, 
stimulation of the warmth receptors 
has a reverse effect, raising peripheral 
thresholds (66; 76; 172, p. 91; 183). 

The effect of vestibular, intero-, 
and proprioceptive stimulation on 
peripheral sensitivity has been the 
subject of a number of studies. In- 
teroceptive stimulation, arising from 
uteral expansion and distension of the 
bladder, is held to affect peripheral 
sensitivity adversely (172, pp. 109- 
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110; 185; 190; 191). Proprioceptive 
stimulation, evoked by muscular 
activity, is reported to induce varying 
effects: after light muscular activity, 
such as finger tapping at maximal 
rate or two minutes of gymnastic drill, 
an increase in peripheral sensitivity 
is noted; after heavy, fatiguing ac- 
tivity, however, à decrease ensues 
(34—40; 89; 91; 159; 172, pp. 91, 110- 
113; 185; 433). Postural stimuli 
exert varying effects on peripheral 
sensitivity of the visual organ: maxi- 
mal sensitivity is elicited when the 
subject is comfortably seated; for the 
standing position it is considerably 
depressed (172, p. 118; 267). Vestib- 
ular stimulation—írom rotation of a 
seated subject five or ten times— 
produces a significant decrease in 
peripheral sensitivity, with return to 
normality requiring periods ranging 
from 5 to 30 minutes (14, 96). 


Dark Adaptation 


Not only is peripheral sensitivity 
affected by various types of acces- 
sory stimulation, but so is the rate 
of dark adaptation. An accelerating 
influence on the process of dark adap- 
tation has been reported from acces- 
sory stimulation of the cold, taste, 
and proprioceptive receptors; a simi- 
lar effect upon accessory illumination 
of the eye with red light has also been 
reported (67; 157; 172, pp. 53-61; 
288; 313; 316; 319; 401; 414). 


Absolute Sensitivity of Central Vision 


Considerable inquiry has been 
made into the effect of accessory 
stimulation on central visual sensi- 
tivity. From the research that has 
been done on this problem, it has 
been ascertained that auditory stimu- 
lation of moderate intensity height- 
ens central sensitivity to white light 
for the dark-adapted eye, but that 
if monochromatic light is employed, 
the effect varies with the wave length 
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utilized. Thus, central sensitivity of 
the dark-adapted eye to blue-green 
colors is raised through auditory 
stimulation, whereas, to orange-red, 
it is decreased. Central sensitivity to 
extreme spectral red and violet, as 
well as to the yellow portion of the 
spectrum near 570 mu, remains un- 
changed however (172, p. 118; 225; 
227-234; 244; 250; 404).* 

Kravkov, one of the foremost 
Soviet researchers in the field of 
sensory interaction, was able not 
only to demonstrate these effects 
over a considerable range of loudness, 
but also to show that the degree 
of effect varied directly with intensity 
of the accessory stimulus (244; 250; 
274, pp. 270-273). He ascertained, 
further, that the observed changes in 
central sensitivity of the visual organ 
were a function of not only the inten- 
sity of the auditory stimulus but also 
of its duration. Thus, in an experi- 
ment to gauge the effect of sound on 
color sensitivity to green light, Krav- 
kov found no effect from an auditory 
presentation of one-minute duration; 
however, he did note a pronoun 
effect on color sensitivity to green 
light following three minutes of audi- 
tory action. 

The opposed character of color 
sensitivity to green and to red lights 
which is revealed when employing 
sound as accessory stimulus 15 dis- 
closed also in experiments not involv- 
ing auditory stimulation (263, pP 
35-39). For example, when the h 
is tilted back (postural acc 
stimulation), it has been foun tha 
color sensitivity to green light (5 


t 


$ Reverse accessory effects have i c 
o 


noted on the central sensitivity of one € 
when the other has undergone adaptation 
red or green light. Adaptation of one eye 
green light is reported to depress d 
sensitivity in the other upon acces y gt 
tory action; adaptation to red light to 

it (143). 
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my) is always markedly decreased, 
proceeding after 90 minutes to 25 
per cent of its original value, whereas 
color sensitivity to orange-red light 
(610 mu) exhibits an occasional tend- 
ency toward increase (267). 

The direction of these effects is 
the reverse of that elicited with sound 
as the accessory stimulus. However, 
chilling, thermal, and gustatory ac- 
cessory stimuli have been reported to 
induce directional effects similar to 
those elicited by auditory accessory 
stimuli. Here again, effects in the 
yellow region and the spectral ex- 
tremes are not to be observed (74; 76; 
245; 251; 256, pp. 48-49; 424). 

The opposed character of color 
sensitivity to green and red lights is 
demonstrated more fundamentally 
by experiments of a quite different 
type. Kravkov and Galochkina, for 
example, were able to demonstrate 
the opposing modifications of sensi- 
tivity to green and red colors by pass- 
ing a weak electric current through 
the eye with strengths of current 
ranging from 0.02 Ma. to 0.5 Ma. 
and times of action up to 15 minutes. 
Where anodal eye contact was em- 
ployed, color sensitivity of the dark- 
adapted eye was increased for green- 
blue illumination, but decreased for 
orange-red. On employing cathodal 
eye contact, the effects were reversed. 
No change was noted as regards the 
extreme ends and yellow region of the 
spectrum (274, pp. 281-282; 280; 
283). The effect of auditory stimu- 
lation on central sensitivity of the 
eye was thereby shown to be similar 
to that resulting from anelectrotonic 
influence—a fact to which consider- 
able significance is attached. 


Differential Sensitivity to Brightness 


Several experiments have been 
performed which produced evidence 
of an effect of accessory stimulation 
on the differential sensitivity of the 
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eye to brightness (70, pp. 216-220; 
98; 99; 274, pp. 216-220; 285; 292; 
293). Illumination of one eye, it 
seems, makes for a simultaneous de- 
crease in differential sensitivity of 
the other eye. Furthermore, the de- 
crease is greater, the brighter the 
white-lighted field on which the lat- 
ter is focused. The same effect can 
be obtained by keeping the bright- 
ness of the test field constant and in- 
creasing the brightness of accessory 
illumination (85, 86, 88, 216, 218). 

Similar effects are elicited with 
sound as accessory stimulus. Thus, 
the brighter the viewed field, the 
greater the decrease in differential 
sensitivity under the effect of simul- 
taneous auditory stimulation (274, 
p. 219); and very loud noise, such as 
that produced by an airplane motor, 
impairs markedly general brightness 
discrimination (32). 

Accessory stimulation of the pe- 
ripheral and central areas of the 
retina has also been shown to affect 
subsequent differential sensitivities. 
Thus, weak illumination separately 
of the peripherally distributed rods 
and cones acts to increase respec- 
tively the differential sensitivities 
of the centrally distributed cones and 
rods, while adversely affecting re- 
spectively those of their centrally dis- 
tributed counterparts (274, pp. 220- 
221; 291-294). 


Critical Flicker Frequency 

Sensory interactional effects are 
evidenced also in the results of stud- 
ies on critical flicker frequency (c.f.f.) 
under the influence of accessory stim- 
ulation (21; 76; 219; 220; 246; 274, 
pp. 357-370).° Kravkov, for example, 
noted that illumination of one eye 
with moderately bright light raises 

? As was to be expected, accessorily in- 
duced changes in photic sensitivities parallel 
those noted in c.f.f., but with opposite sign 
(274, p. 365). j 
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c.f.f. in the other, but wards off the 
reduction of c.f.f. that is usually ob- 
served in central vision of the other 
eye in the course of dark adaptation 
(274, p. 363). 

It was demonstrated, in addition, 
that the effects of auditory stimula- 
tion on c.f.f. depend on the mono- 
chromatic nature of the light used. 
Thus, during the time of auditory ac- 
tion c.f.f. for green light (520 my) 
is reduced, whereas for orange-red 
(630 my), c.f.f. is raised. Here cen- 
tral retinal illumination was of such 
brightness as to produce critical flick- 
er frequencies of 12-20 flashes per 
second. Accessory stimulation was 
pitched at 800 cycles at 85 db. On 
employment of white light as the 
primary stimulus, accessory auditory 
action was likewise found to affect 
c.f.f.: for central vision it is height- 
ened, for peripheral vision lowered 
(220; 246; 256, pp. 49-50; 274, pp. 
364—365). 

Experimenters, employing the 
odors of bergamot oil and geraniol, 
ascertained the effect of olfactory 
stimulation on c.f.f. (251; 274, pp. 
273-215). With brightness of flicker 
acting upon the central retina so as 
to produce a c.f.f. of 12-18 flashes 
per second for an eye at a given level 
of dark adaptation, the reduction 
of c.f.f. for green-blue illumination 
and its increase for orange-red under 
olfactory stimulation were estab- 
lished. No effects were noted for 
yellow (around 570 my) or for extreme 
spectral red (690 my) and violet 
(425 my). Furthermore, upon cessa- 
tion of accessory action, it was found 
that c.f.f. returned not only to pre- 
vious levels, but having done so, fre- 
quently proceeded to exhibit relative 
Írequencies the reverse of those pre- 
viously obtained under olfactory in- 
fluence. A similar picture of change 
in c.f.f. has also been disclosed upon 
accessory application of gustatory 
and thermal stimuli (76). 
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Reverse effects have been 
by varying the intensity of ligh 
ployed. Thus, if brighter light 
be employed to bring the 
sponding c.f.f. to 26-30 flashes” 
second, then under the action of 
same accessory stimulation c.f.f. may 
change in the opposite direction to 
that obtaining for brightness marked - 
by a c.f.f. of 12-18 flashes per second. 
In other words, if c.f.f. for weak l 
is reduced by accessory stim 
then much brighter light may um 
the same conditions increase it (274, 
pp. 367-368). ; 


Visual Acuity and Irradiation 


A number of studies have 
devoted to research on the effec 
accessory stimulation on visual 
ity (119; 209; 210; 214; 215; 
472; 256, p. 50; 265, pp. 
276-278). These show, for exan 
that where closely set, black 
are to be distinguished from | 
other against a white backg 
visual acuity is sharpened under 
cessory stimulation; but where 
shades are interchanged, it is dim 
ished. Of interest also is the fact th 
upon monaural stimulation, espt 
ly marked effects are to be noted € 
the visual acuity of the contralate 
eye (274, p. 187). 

In irradiational terms One 
speak of the increase of irradi 
effect under accessory stim! 
Visual acuity in the above case 
then be thought of as enhance 
cause, through increased irradia 
effect under accessory stim 
the white interspaces between 
black forms crowded on a 
background are rendered larger * 
the black forms are consequen 
ter brought out. Where the | 
are interchanged, the redu 
the black interspaces under í 
sory stimulation would explai 
resulting diminution of visual ac 
(214, 215, 276-278). 
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Whether the accessory stimulation 
be through light, sound, or odor, the 
increase of irradiational effect for 
white is demonstrable in every case. 
For example, increase of irradiational 
effect is induced by the odor of berga- 
mot oil (214, 251), and under ac- 
cessory auditory stimulation irradia- 
tional effect for orange-red increases 
(267, p. 34)—a finding which might 
not have been anticipated in view of 
the reputed decline of color sensitivity 
to orange-red under auditory stimu- 
lation. This fact is considered to be of 
importance in surmising a probable 
mechanism behind the action of ac- 
cessory stimuli on visual afferentia- 
tion, and we will return to it later. 


Electrical Sensitivity of the Eye 


Electrical sensitivity of the eye!^ 
has been shown to be modifiable by 
the action of accessory stimuli (30, 
pp. 162-164; 70, p. 188; 94; 139; 410; 
412; 415; 462; 463; 479). For exam- 
ple, illumination of one eye always 
raises the electrical sensitivity of the 
other. Auditory stimulation likewise 
increases electrical sensitivity, pro- 
vided the sounds employed are not 
extremely loud, in which case elec- 
trical sensitivity is decreased (20, 30, 
130, 132). Gustatory stimulation 
produces varying effects on the elec- 
trical sensitivity of the eye. Where 
sugar furnishes the accessory stimu- 
lus, electrical sensitivity is increased; 
where the stimulus is salt or citric 
acid, it is reduced. Under the olfac- 
tory action of bergamot oil increase 
in threshold of electrical sensitivity is 
reported to result (71, 72). 


Color Zones 
The boundaries of color zones have 


© Electrical sensitivity of the eye is in- 
versely indexed by the minimum strength of 
the applied current necessary to induce visual 
sensation (phosphene) on closing and breaking 
the circuit. Thus, the weaker the current 
inducing phosphene, the higher the electrical 
Sensitivity of the eye. 
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been reported to shift under acces- 
sory stimulation. Thus, with lateral 
stimulation of the retina with bright 
light expansion of the red and yellow 
color zones and contraction of the 
green and blue zones result (105), 
With sound as accessory stimulus the 
zonal boundaries for green and blue 
are enlarged, those for orange-red 
drawn in, while those for extreme red 
remain as they are (144-146)—ef- 
fects which are reminiscent of the 
changes, previously described, in 
color sensitivity brought on through 
accessory auditory stimulation. 

Olfactory stimuli also have effects 
on the zonal boundaries (48, p. 281). 
If the odors of rosemary and indol 
are employed, expansion of the green 
region results with the former and its 
contraction with the latter. As re- 
gards the red zone, rosemary is 
found to contract, but indol to ex- 
pand it—effects the reverse of those 
above. However, alteration of red 
zonal boundaries is not always ob- 
served (274, p. 275; 398; 399, pp- 
389-396). 


Contrast 


Changes in visual contrast effects 
under the influence of auditory stimu- 
lation have also been the subject of 
investigation (446). The effects vary 
with the degree of contrast obtaining 
prior to accessory action. Thus, if 
a gray circular area on a white back- 
ground is presented with a tone of 800 
cycles at 75 db as accessory stimulus, 
contrast is heightened if the initial 
difference of brightness between the 
gray circle and white background is 
in itself great. On the other hand, 
contrast is diminished if the difference 
is initially small. 


Afterimages 

Accessory auditory stimulation has 
been shown to affect the course of 
extinction of visual afterimages (363, 
364, 4690-472). In a number of 
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people, for example, a strong auditory 
stimulus heightens the brightness of 
the afterimage, though hastening the 
termination of its course. Here the 
accessory stimulus precedes the incep- 
tion of retinal illumination and is 
continued until extinction of the re- 
sulting afterimage. Sounds that are 
powerful enough to evoke fairly in- 
tense auditory afterimages tend es- 
pecially to heighten the brightness of 
the visual afterimage. However, 
should an extremely loud tone be 
presented after the cessation of reti- 
nal illumination, the afterimage un- 
dergoes temporarily complete dis- 
appearance. 


Auditory Sensitivity 


While most research has employed 
light as the primary stimulus, a 
number of studies have been under- 
taken with sound performing the 
role. Here again effects of accessory 
stimulation have been demonstrated. 
Illumination of the eyes with white 
light, for example, is claimed to in- 
crease auditory sensitivity; absence 
of visual stimulation, on the contrary, 
decreases it. However, opposed ef- 
fects are obtained by using, as ac- 
cessory stimuli, different monochro- 
matic lights. Thus, illumination of a 
white room with green light increases 
auditory sensitivity; but with red 
light it is decreased (182; 202; 312, 
p. 155; 351; 429). 

Auditory thresholds are lowered 
also when the odors derived from 
geraniol and benzol supply the ac- 
cessory stimulation (48, p. 281; 134; 
267; 379; 380).  Postural stimuli 
likewise are reported to affect audi- 
tory sensitivity. Thus, on tilting the 
head backward for any length of time 
auditory sensitivity is found to suffer. 
It is also reported that higher audi- 
tory sensitivity is obtained in the 
seated position than in either the 
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standing or horizontal position (| 
p. 118). 

Gustatory stimulation has 
reported to improve auditory 
tion of low tones, but to worsen sor 
what that of high tones. Estab 
also have been the facilitating 
on hearing of accessory stimulatiom 
the cold and proprioceptive ree 
tors (172, p. 82; 366). 


Miscellany 


Comparatively little work 
been done where the primary sti 
lus has not been visual or audite 
(327, p. 763). But what work 
been done shows the presence of 
cessory effects. Thus, it is avem 
illumination of the eyes with wh 
light heightens gustatory and olfa 
tory sensitivity (48, pp. 253-25 
whereas auditory stimulation redu 
the electrical sensitivity of the gu 
tory apparatus (194; 267, p. 
Electrical sensitivity of the gustate 
apparatus, as well as its sensitivity! 
salt solution, is reported also to d 
crease in the course of dark ada } 
tion of the eye. Mental activit 
however, excludes the latter ete 
(48, pp. 140-141; 71; 72; 457). - 

Research on the accessory €m 
of the odors of geraniol and thym 
on the vestibular apparatus disclos 
reverse effects. The odor of gerani 
heightens vestibular excitability, P! 
longs rotary postnystagmus, lengi 
ens the illusion of counterrotat 
and shortens chronaxie, while that 
thymol has the opposite effect ( 
pp. 281—282; 150; 151). : 


Some CONDITIONS AFFECTING 
SENSORY INTERACTION —.— 


Strength of Accessory Stimulus 
acc 


Increase in strength of the P 
sory stimulus leads so frequen y 
effects that are the reverse Of © 


induced by weaker intensities t 
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one can almost speak of a "rule of 
inversion". (67; 111; 161; 172, pp. 
80, 88; 178; 184; 226; 263, pp. 39- 
41; 274, pp. 180-181, 190; 344; 448- 
451; 474). For example, sounds of 
weak intensity heighten the electri- 
cal sensitivity of the eye, while those 
of increasing intensity are accom- 
panied by a gradual decline. How- 
ever, Kravkov doubts the generality 
of any “rule of inversion" and points 
to one case where the rule itself is 
controverted. Varying the intensities 
of sounds employed as accessory 
stimulation, he found that an in- 
crease from 25 db to 95 db was paral- 
leled by a steady increase of sensi- 
tivity to green light (530 mu) and a 
steady decrease to orange light 
(590 mu)—findings which deny a 
general "rule of inversion" (267, p. 
93; 274, p. 180). 


Excitatory State of Primary Sense 
Organ 


The effect of accessory stimulation 
depends not only on the intensity 
of the accessory stimulus, but also on 
the initial degree of excitation of the 
sense organ undergoing primary stim- 
ulation. Thus, in order to increase 
the loudness of a tone on accessory 
visual stimulation, the initial loud- 
ness must be sufficiently great. Other- 
wise, either no effect will be demon- 
strable or a loss in loudness will 
ensue. The example cited under Con- 
trast in the previous section is an- 
other instance of the effect of initial 
degree of primary excitation. 


Duration of Accessory Stimulation 


_ Intensity of accessory stimulation 
is not the only factor of importance; 
duration is also important. Kravkov 
(130-132, 184, 344, 348) has shown 
that, as a rule, accessory effect be- 
comes increasingly prominent during 
the first moments of accessory action. 
But subsequently the effect very of- 
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ten away from the maximum 
attained (274, p. 186). For example, 
auditory stimulation was 
applied for a half hour while the eye 
underwent simultaneously 31 to 61 
minutes of dark adaptation, Critical 
flicker frequency was found during 
this period to rise sharply, but to 
diminish after a quarter hour from 
the maximum previously attained 
(220). 
Termination of Accessory Stimulation 

In describing the influence of ac- 
cessory stimulation on à given sense 
organ, it is, of course, necessary to 
stipulate whether the state of the 
latter is tested before or after cessa- 
tion of accessory stimulation, since, 
after withdrawal of the accessory 
stimulus, its effect does not just 
wear off with an eventual return to 
normalcy. There frequently arises 
instead a shift in a direction opposite 
to that of the initial effect. 
sensitivity may be followed by a 
supernormal phase, heightened sensi- 
tivity by a subnormal (157; 274, p- 
190). 

For example, Kravkov observed 
that sound lowers c.f.f. for peripheral 
vision. After its termination, how- 
ever, he noted that under the same 
experimental conditions cff. is 
raised, sometimes markedly (220; 
274, p. 364). As a further illustra- 
tion, another experiment may be 
cited (405). Here, on testing the pe- 
ripheral sensitivity of the eye after 
cessation of auditory stimulation, it 
was observed that a supermo 
phase emerged subsequent to the de- 
crease generally registered during the 
action of the accessory stimulus. 


Affectivity of Stimulus 

It appears that stimulus affectivity 
is a factor to be reckoned with in 
sensory interaction. For example, it 
has been claimed that, where gusta- 
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tory and olfactory stimuli of unpleas- 
ant character are applied as accessory 
agents, peripheral sensitivity of the 
eye declines (172, p. 88; 345). It is 
also reported that, after presentation 
of harmonious tones, color sensitivity 
of the dark-adapted eye to red and 
yellow is raised, to green-blue low- 
ered, whereas disagreeable sounds 
reverse the effects (259, 426, 428). 


Physiological State 


The specific physiological state of 
the sense organs, as well as the more 
general physiological state of the 
individual, plays a role in sensory 
interaction (274, p. 169; 399, pp. 412— 
421; 455). A number of experiments 
may be mentioned by way of exam- 
ple. In a previously cited experiment, 
it was shown that presence of a sub- 
ject in a green-illuminated white room 
heightens auditory sensitivity, where- 
as presence in a red-illuminated white 
room brings about a decline. How- 
ever, on changing the physiological 
state of the subject through moder- 
ate dosage of veronal, the experimen- 
tal effects are precisely reversed. In 
the hyposthenic state induced by 
veronal, green illumination lowers, 
red illumination heightens auditory 
sensitivity (189, 429). 

Similarly, it has been demonstrated 
that with intense auditory stimula- 
tion the electrical sensitivity of the 
light-adapted eye undergoes decrease, 
and that of the dark-adapted eye, 
increase (20; 139; 274, p. 188; 303). 
Electrotonic stimulation of the eye 
with anodal contact thereon (anelec- 
trotonus) induces similar effects; 
with cathodal contact (catelectroto- 
nus), however, reverse effects show up 
(274, p. 190; 411). 

Instillation of adrenalin into the 


"In various hypodynamic states, for exam- 
ple,the more usual effects of accessory stimu- 
lation tend to sharper expression (172, pp. 
97-117). 
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eye has likewise been shown to bi 
on similar modifications in the 
trical sensitivity of the eye: for 
dark-adapted eye, electrical 
tivity declines; for the light 
it increases (106, 303). Si 
various accessory effects on r 
and color sensitivity can be made to 
reverse sign depending on whether 
the tested eye is light- or dark- 
adapted or whether it is adapted to 
green or red light (235, 240, 249, 
268, 298, 415). e£ 


Miscellany :z 


Summation of accessory eff 
each of two sense modalities, on 
cessory stimulation, induces am 
crease in peripheral sensitivity of the 
eye, their joint action induces am 
effect beyond that induced singly. 
If, however, the individual effects — 
are in opposite directions, their joint 
action induces a general effect on 
peripheral sensitivity equal approxi- - 
mately to their difference (172, pp. - 
80-81; 181). E 

Repetition of accessory stimulation. — 
Constant repetition over equal inter- - 
vals of time of the same accessory - 
stimulus has been found frequently 
to yield a diminishing effect w 
tends to zero (172, p. 81). 

Cumulation of accessory effi 
The effects of accessory stimula’ 
linger in some instances so that : 
subsequent days the change in leve 
of primary threshold is found to 
dergo successive increase (172, 
81). 


paratus exhibit an inverse reci 
of accessory influence that is diurr 
dependent. Thus, it has been sh 
that at midday, when central 
tivity is at its maximum, periph 
sensitivity is minimal. As am 
relation, degree of sensitivity í 
one apparatus is inversely tied. 
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degree of sensitivity of the other 
(431, 436-438). 


Some THEORY ON SENSORY 
INTERACTION 


In the attempt to comprehend the 
nature of the modifications of sensory 
response induced through accessory 
stimulus action, several considera- 
tions have been put to theoretical 
work. The result to date appears 
more a collection of eclectic plausibil- 
ities than an integrated set of rele- 
vant explanations. The following dis- 
cussion will report what these con- 
siderations are. 


Contiguity 


Over a considerable range of audi- 
tory and visual stimuli certain visual 
and auditory responses undergo in- 
tensification through respective ac- 
cessory reinforcement via intermodal 
action. To explain a number of them, 
such as the increase in visual acuity 
of one eye upon monaural stimula- 
tion of the contralateral ear, recourse 
has been had to the neurological map. 
In the region of the corpora quadrige- 
mina, for example, the visual and 
auditory nerve fibers are in close 
proximity and myelin-free. Since 
nerve action in one unmyelinated 
fiber is held likely to influence that 
of another contiguous to it, the feasi- 
bility of excitatory irradiation 
through contiguity of the conducting 

ts is considered to be demon- 
strated. The fact and feasibility of 
excitatory irradiation in the region of 
the corpora quadrigemina and also of 
the corpora geniculata seem to be 
Justified, in the main, by reference to 
foreign work (6; 121; 152; 156; 265, 
P. 29; 274, pp. 187-188; 304; 319). 

Similar considerations are thought 
to make olfactory influence on vision 
understandable, for in the region of 
the stratum zonale and nucleus anterior 
thalami the visual and olfactory 


51 


nerve fibers lie in close proximity. 
Thus, the olfactory afferent system 
may likewise have its effect on visual 
phenomena through excitatory ir- 
radiation. 

The fact that there is anatomical 
proximity along the auditory and 
visual nerve pathways is also thought 
to make explicable many cases of 
synesthesia. On this basis the latter 
can be viewed as instances of sensory 
interaction (267, pp. 59-63). 
Neural Excitation 

The question arises as to whether 
it is better to view many instances of 
accessory effect as mediable through 
change in neural excitation rather 
than through change in neural excita- 
bility, that is to say, through thresh- 
old modification, although either 
alternative could conceivably ac- 
count for a number of the intensifying 
effects produced by accessory stimu- 
lation. Kravkov and others, as a mat- 
ter of fact, designed a number of 
experiments to throw light on just 
this point (265, pp. 26-29; 274, pp. 
222-230). 

It had been established earlier 
that the sensitivity of central vision 
to orange-red light declines with 
simultaneous auditory stimulation; 
that is to say, a rise of threshold or 
decrease of excitability takes place. 
Kravkov reasoned that, if on acces- 
sory auditory stimulation positive 
irradiation could, nevertheless, be 
demonstrated to intensify for an ob- 
ject of orange-red hue (if its apparent 
size was increased), then change not 
in neural excitability, but in neural 
excitation must be decisive for 
achieving the observed effects. If 
positive irradiation decreased, then 
change in neural excitability could 
be taken to operate as the major 
factor. 

Experiment realized the former 
contingency—the effect of positive 
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irradiation was increased, a result 
leading to the conclusion that acces- 
sory auditory stimulation may add 
to visual excitation of certain kinds 
(257). Thus, on this basis, the data, 
relevant to both positive irradiational 
effects and visual acuity on acces- 
sory auditory stimulation, are held 
explicable. 

Further support for the emergence 
of an accessory effect through change 
in neural excitation is derived from 
the fact that with accessory auditory 
stimulation increase of differential 
threshold is not accompanied by a 
seemingly necessary decrease of ir- 
radiational effect. It is held that this 
failure of prediction is also accounted 
for on the assumption of an acces- 
sorily induced upward modification 
of neural excitation which renders 
AC/C greater in magnitude than 
AI/I, of which it is the “central 
physiological correlate... C and 
AC being measures of excitation in 
the nervous centers" (274, pp. 222- 
225). 


Leveling and Accentuation 


Under the “rule of leveling and 
accentuation” a refinement, implicit 
in the foregoing, was explicitly in- 
troduced to allow a wider encompass- 
ing of disparate data (257; 263, pp. 
31-35; 265, pp. 26-29; 274, pp. 225- 
230). This refinement consists in 
taking note of the initial contrast 
between the viewed object and its 
background. The following discussion 
points to this necessity. 

It was shown that, with simul- 
taneous auditory stimulation, visual 
irradiation of a white object against 
a black background intensifies, while 
that of a gray object against the same 
background diminishes (215). Here 
is an example of the kind of opposite- 
ly directed effects for which a uni- 
fied explanation is needed. The rule 
of leveling and accentuation that has 
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been proposed aims to provide 
where initial contrast is large, 
centuation of contrast occurs; wh : 
it is small, leveling or reduction ol 
contrast results. The reasoning be 
hind the rule is fairly direct and m 
be indicated by way of example, 
Supplementary excitation of th 
visual apparatus on accessory aud 
tory stimulation is not distributed 
uniformly over the visual field but is 
unevenly apportioned. Those region 
of the visual apparatus which already 
are to a considerable degree excited, 
“attract to themselves" the m j 
portion of the additional excitatiot 
the less strongly excited regions shar 
more modestly. In other words, the 
magnitude of supplemental excit 
tion, accessorily induced, is a fune 
tion of the excitation directly oc 
casioned by the primary stimulus 
Thus, on accessory visual stimulation 
loud tones should sound louder; weal 
tones should be little, if at all, af 
fected. In this way the rule of ac 
centuation is held explicable. 
But why a rule of accentuation @ 
leveling? Again an example will eluci; 
date. It is known that differenti 
thresholds for brightness discrimimi 
tion increase with brightness. If the 
is little initial brightness contra! 
between object and background, ài 


have been J2/J,, it is centrally no 
In+AE/I,+AE; AE is taken as t 
“brightness equivalent of the suppl 
mentary excitation” and 757 I1. Th 
leaves the resulting difference ? 
excitation about the same. The mag 
nitude of the latter ratio, now !e 
than the former, indicates, hov 
an impairment in sensing the b 
ness difference that obtains. 
brightness difference between ODI 
and background, initially contras" 
little, comes to be even harder to 
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serve with the result that a leveling 
tendency in brightness difference 
establishes itself (274, p. 225). 

On the the other hand, should the 
initial contrast be very large, acces- 
sorv increase of excitation accrues, in 
the main, to the brighter source as 
previously explained, so that the ef- 


fect of contrast comes in this in- 
stance to be seen as exaggerated 
(353, 446). 


Similar reasoning would account 
for the oppositely directed shifts in 
critical flicker frequency that are to 
be observed when accessory auditory 
stimulation is employed to reinforce 
flashes of low and high brightness 
respectively. ^ Critical flicker fre- 
quency is increased when the bright- 
ness of flash is initially high; it is 
decreased when initially low. Here 
the flash is thought of as an object 
presented against a less bright back- 
ground represented by the fading 
afterimage between flash phases (274, 
pp. 366-370). 


Intracentral Mediation 


An accumulation of experimental 
data (274), as well as the anatomical 
richness of intracentral connections, 
made natural the consideration of 
intracentral mediation as a factor 
in sensory interaction. Certain spe- 
cific research seemed also to suggest 
the intracentral mediational char- 
acter of accessory action. For exam- 
ple, in the course of registering the 
bioelectrical currents in the auditory 
region of the cerebral cortex of rab- 
bits, it was discovered that these 
Currents are modified in an interest- 
ing fashion, should the eye of the 
animal be exposed to regular flicker: 
the rhythm of electrical potential in 
the auditory region comes, after an 
interval, to correspond to that of the 
flicker (332). In addition, the facts 
of protopathic and epicritic sensitivi- 
ties were taken also to demonstrate 
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the presence of central inhibitory 
action of some afferential systems on 
others. Thus, the tactile and pro- 
prioceptive systems, taken as phylo- 
genetically of more recent origin, are 
seen as antagonistic in action to the 
systems giving rise to those sensations 
of gross temperature and diffuse pain 
which appear on disturbance of the 
inhibitory action of the former (265, 
p. 29; 327, p. 611; 371; 372, pp. 39- 
53; 375). 

The widespread influence of the 
Pavlovian propositions of cortical ac- 
tion (since 1950 official dogma) has 
also had its effect in increasing the 
plausibility of intracentral mediation 
of accessory action (274, pp. 178-180; 
338-340; 374, pp. 136-296). Thus, 
on the assumption of such mediation 
two possibilities present themselves: 
either excitation of a cortical center 
leads to excitation of another, modal- 
ly distinct, or it leads to depression 
of the latter. Accessory effects may, 
therefore, be reinforcing or antagonis- 
tic. 

It has previously been pointed out 
that central illumination raises pe- 
ripheral threshold, while peripheral 
stimulation decreases central sensi- 
tivity. The relationship is mutually 
antagonistic, provided central vision 
is not that of the achromat, inwhom 
an antagonistic influence is not ex- 
hibited due to cone deficiencies. 
Since central illumination of one eye, 
for example, affects adversely pe- 
ripheral sensitivity in the other, it 
has been concluded that such effects, 
as indicated above, must, therefore, 
be intracentrally mediated (391). 

Intracentral mediation of acces- 
sory action functions also to account 
for many of the reverse effects elic- 
ited upon increase in strength of 
the accessory stimulus (“rule of 
inversion”). The assumption is made 
that an initially weak accessory 
stimulus sets up two central proc- 
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esses: one positively inductive; the 
other negatively inductive. Upon 
increase in strength of the accessory 
stimulus, increases in the magnitude 
of the two oppositely signed inductive 
effects are given by two differently 
accelerated growth curves. Reversal 
of modal accessory effect is associ- 
ated, therefore, with the change of 
sign resulting from algebraic summa- 
tion of the two curves, once a cer- 
tain magnitude of the accessory stim- 
ulus has been exceeded (274, p. 181). 


Ionic Balance 


It has been pointed out that under 
anelectrotonic influence central sensi- 
tivity of the dark-adapted eye to 
green-blue is increased, to orange- 
red decreased, whereas under catelec- 
trotonic influence the reverse is ob- 
served. These changes parallel re- 
markably those resulting from the 
application of appropriate accessory 
stimuli and, hence, have been the 
subject of special study. 

Attempts to explain the results of 
electrotonic influence have generally 
been based on considerations of ionic 
concentration and balance (274, pp. 
281-286; 280, pp. 138-143; 415). In 
view of the known role of ionic cal- 
cium and potassium in the excitabil- 
ity of tissues, investigation turned 
first to them and later to ionic sodi- 
um and magnesium. 

Since in a solution potassium ions 
are univalent and those of calcium 
bivalent, the former migrate to the 
cathode more rapidly than the latter 
upon application of direct current to 
acircuit. The value of the ionic index 
[K]/[Ca], which measures the rela- 
tive concentration of ionic potassium 
and calcium, increases, therefore, at 
the cathode; it decreases, of course, 
at the anode due to increasing con- 
centration of ionic calcium there. 

If ionic factors are involved in the 
reported modifications of central 
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sensitivity, it was reasoned that 
changes in the relative concentration 
of ions should be accompanied by 
changes in the various central sensi- 
tivities. Through iontophoresis" it 
was possible to introduce ionic cal- 
cium and potassium into the eyeball 
in order to test the surmise. Predic- 
tions were fulfilled (274, p. 284; 
283): ionic calcium was found to in- 
crease sensitivity to green but to 
decrease it to orange-red, whereas 
ionic potassium was found to in- 
crease sensitivity to orange-red but 
to decrease it to green. Termination 
of the iontophoretic process reversed 
the respective effects (262, 281, 282, | 
284). 

Further experimentation showed 
that ionic sodium was similar in ac- 
tion to ionic potassium, while ionic 
magnesium was similar in action to 
ionic calcium (260, 376, 377). Hence, 
decrease in value of the ionic index 
[K]--[Na]/[Ca]--[Mg] is associated 
with increase in sensitivity of the 
green- and blue-sensing apparatus 
of the eye, while increase in value of 
the ionic index is tied to increase 1m 
sensitivity of the red-sensing appara- 
tus. ` 

Since ionic sodium and potassium - 
are known to favor the concentra- 
tion of acetylcholine in tissues, where- 
as ionic calcium and magnesium are 
known to lead to its reduction, it 5 
considered very likely that various 
concentrations of acetylcholine exhib- 
it respective optimal effects for the 
various color-sensing apparatuses 0 
the eye (274, p. 284). In the last 


1 The introduction of ions into the body by 
means of electric current. P 

3 Through iontophoresis it was pe 1 
possible to neutralize certain accessory eff of Y. 
on central sensitivity. Thus, iontophoresis, 
sodium, which reduces central sensitiv! 
to green (520 my), neutralizes the a ndet 
effect of auditory stimulation which, ut ity 
ordinary conditions, heightens this sensitiv! 
(275, p. 146). 


= 
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analysis, then, central sensitivities 
are seen as dependent on ionically 
influenced acetylcholine concentra- 
tion (275, pp. 143-145). 


The Autonomic Nervous System 


A variety of accessory stimuli such 
as tones, noise, the taste of sweetness, 
the odor of bergamot oil, geraniol, 
and camphor has been reported to 
produce similar differential effects 
on retinal sensitivity to green-blue 
and orange-red lights: heightening 
sensitivity to the former, decreasing 
sensitivity to the latter (228, 234, 
251). This circumstance led re- 
searchers to look for a common factor 
behind this discerned uniformity of 
differential action. They noted that 
application of the above stimuli was 
accompanied by a quickening of the 
pulse, and it was conjectured that the 
common factor might well be the 
autonomic nervous system, in view 
of the fact also that the odor of thy- 
mol, which reverses the differential 
effects cited above, was found to re- 
duce pulse rate (8;9; 160; 172, pp. 81, 
88; 242; 252; 253; 256, p. 49; 265, p. 
31; 399, pp. 412-419). 

This conjecture among others led 
to experimentation which disclosed 
that the color sensitivity of the eye 
was changed in the reciprocal pattern 
mentioned above on direct instilla- 
tion of adrenalin into the eye. Fur- 
ther research along these lines demon- 


“ Tonic calcium and magnesium are known 
to activize cholinesterase which through hy- 
drolysis reduces acetylcholine content; ionic 
sodium and potassium, on the other hand, are 
known to suppress cholinesterase, thereby 
Increasing acetylcholine content. 

Interference with the acetylcholine chain 
of reactions, as through homatropinization 
of the eye, stabilizes central sensitivities in 
Spite of iontophoresis. Hence, strictly speak- 
ing, not acetylcholine, but processes involving 
it in an essential way are to be associated 
With modifications of central sensitivities 
(354, 377). 
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strated that sympathomimetic sub- 
stances such as cordiamin, phena- 
mine, sympathol, and ephedrine 
heighten sensitivity to green light 
for the dark-adapted eye, but lower it 
to red light. On the other hand, para- 
sympathomimetic substances such as 
carbocholin, veronal, the berries of 
Schizandra Chinensis, and pilocarpine 
usually bring on reverse effects: sen- 
sitivity to red light is heightened for 
the dark-adapted eye, while that to 
green light is lowered. Ephedrine is 
reported to heighten also gustatory, 
olfactory, and auditory sensitivities 
(48, pp. 255-258; 112; 114; 115; 147; 
148; 153; 252; 253; 274, pp. 276-278; 
279; 303; 399, pp. 401—407). 

Confirmation of the autonomic 
mediation of a number of accessory 
effects has been sought also in data 
from experiments involving electri- 
cal application to the eyeball. For 
example, it will be remembered that 
if direct current of weak intensity is 
passed through the dark-adapted eye 
with anodal contact, sensitivity to 
green-blue light is heightened with 
corresponding diminution to orange- 
red light—the characteristic antithe- 
sis. The effects on sensitivities are 
reversed with cathodal contact (268; 
274, p. 190; 280; 283; 411). 

Now, it was pointed out that the 
relative concentration of calcium ions 
is increased in the vicinity of the 
anode during electrical action. Since 
increase in relative concentration of 
calcium ions, in a very great number 
of instances, is understood to induce 
effects similar to those of sympathetic 
excitation, the mechanism of the 
above accessory effects is held to be 
probably autonomic (262; 265, p. 
33; 274, pp. 285-286; 282; 284). 

Therefore, auditory stimulation, 
the odor of bergamot oil and geraniol, 
and the taste of sweetness are said 
to achieve their accessory effects 
through influence on the sympathetic 
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division of the autonomic nervous 
system with differential effects on 
the color-sensing apparatus of the 
eye; adverse effects are induced where 
parasympathetic excitation would 
instead ordinarily facilitate. 

Corroboration of autonomic ac- 
tion has been sought in a variety of 
experiments. For example, back- 
ward tilting of the head for a period 
of time is claimed to bring about 
conditions favoring parasympathetic 
excitation. One should then expect a 
reduction of sensitivity to green 
light—an expectation which was con- 
firmed (265, p. 32; 423). 

It should be noted that the differ- 
ential action of the two divisions of 
the autonomic nervous system on 
the green- and red-sensing apparatus 
is also consonant with the observed 
reciprocal action of green and red 
light on a number of physiological 
functions. For example, the internal 
ocular pressure of one eye is increased 
or decreased according as the other 
eye is illuminated with red or green 
light respectively (476, 477). A simi- 
lar reciprocity of action has also been 
observed with hyperventilation which 
decreases sensitivity to green light 
while increasing it to red (296). 

Certain changes in peripheral 
vision are likewise held ascribable to 
modifications of the autonomic state 
(8, 9). Thus, vestibular stimulation, 
which is said to favor parasympathe- 
tic excitation, was noted to worsen 
peripheral sensitivity (14). Back- 
ward tilting of the head is also re- 
ported to decrease peripheral sensi- 
tivity, which fact accords with the 
increase of parasympathetic excita- 
tion held attributable to the tilted 
position of the head. Changes in 
peripheral sensitivity have also been 
noted with alteration of relative 
ionic concentrations, electrically in- 
duced—a fact which again is taken 
to relate modification in peripheral 
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sensitivity to autonomic 
(283). 

Further confirmation of the 
nomic mediation of ace 
was sought in experiments im 
the cerebellum. For example, 
shown that subjection of the ee 
bellum to electromagnetic wave 
ultra high frequency definitel 
tered the level of sensitivi 
peripheral vision (334). Sine 
cording to Orbeli, the eb 
must be regarded as one of the 
regulators of sympathetic 
activity, the results of c 
such as the above were ta 
tending to demonstrate the 
dependence of the visual ap; 
on the autonomic nervous sys 
(372, 383).'5 t 

While a wide variety of da 
accounted for on the basis of a 
nomic mediation of accessory ac 
Kravkov points out that the ini 
drop in peripheral visual sensi 
on accessory auditory stimulati 
an experimental datum which 
not harmonize with all of the 
going. Auditory stimuli appe 
be sympathotropic as to effe 
pulse rate is stepped up, the sens 
tivity of the green-sensing apparatu 
of the eye is heightened, the reti 
blood vessels are constricted, et 
But peripheral sensitivity appears” 
be sympathotropically affected it 
much as it increases under anelec 
tonic conditions and decreases 
vagotropic stimuli such as are 
vided by the backward tilting © 
head. One should, therefore, e 
a positive effect on periphe 

% Orbeli's theory of the "'adaptive-tri 
regulatory action of the sympathetic n 
system on all tissues of the body, inc 
the central nervous system and the 
musculature, has had a considerable | 
on research in the field of sensory 1n 
(55; 69; 95; 123; 160; 162, p. 77; I 
193; 256, p. 46; 334; 336; 337; 368-3 
468). 
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tivity through accessory auditory 
stimulation. The reverse, however, 
takes place. To account for this, 
Kravkov hypothesizes the existence 
of intracentral connections whose in- 
fluence in an ad hec fashion domi- 
nates antagonistically over those 
influences proceeding through the 
medium of the autonomic system 
(267, p. 81). 


The Green Receptors 


The fact that the application of 
either sympatho- or vagotropic ac- 
cessory stimuli leaves unaffected 
sensitivity both to extreme spectral 
red and violet and to colors in the 
region of 570 my has led to the assign- 
ment of an especial role to the green- 
sensing apparatus of the eye. It is 
hypothesized that only if the green 
receptors are stimulated is accessory 
heightening or lowering of color sensi- 
tivity evocable (254; 256, p. 49; 266; 
274, pp. 327-331; 352). 

Throughout the spectral range of 
excitability of the green receptors 
(from 430 mp to 680 mu approxi- 
mately) excitation of the green- 
sensing apparatus appears every- 
where accompanied by accessory ef- 
fects except in the neighborhood of 
570 mu. Reference to the three exci- 
tation curves of the three basic color 
receptors (b, g, r) shows that the ordi- 
nates Y, and Y, are about equal in 
the neighborhood of 570 my, Y being 
taken as equal to degree of excitation. 
Since excitation of the green receptors 
is held to induce a positive accessory 
effect, the failure of accessory effect 
in the neighborhood of 570 mp is 
ascribed to an initiated negative ac- 
cessory effect of equal magnitude, in- 
duced by a simultaneous excitation 
of the red receptors. 

Since magnitude of unsummated 
accessory effect is regarded as a 
direct function of degree of excitation 
of the respective color-sensing ap- 
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paratus involved, where Y,» Ya the 
algebraic sum of the associated posi- 
tive and negative accessory effects 
yields a residual accessory effect 
which is positive; where Y, < Y,, the 
residual accessory effect is negative; 
where Y,- Y, accessory effect is 
"rs (430 < my «680 approximate- 
y). 

Excitation of the blue receptors is 
thought to enhance accessory effect 
on the condition that mu »430 ap- 
proximately. Thus, in the neigh- 
borhood of 520 ma where F= Y, 
approximately, the inhibiting action 
of the red receptors is canceled out by 
the sensibilizing action of the blue 
receptors. This produces, according- 
ly, a positive accessory effect derived 
solely from stimulation of the green 
receptors, one which, as it turns out, 
is maximal. 

On making the inhibitory and sen- 
sibilizing action of the red and blue 
receptors respectively explicitly de- 
pendent upon excitation of the green- 
sensing apparatus, the failure to elicit 
accessory effects at the spectral ex- 
tremes (ma «430; mu» 680 approxi- 
mately) is made understandable. 

To test the hypothesis that ac- 
cessory effects on color sensitivity in 
trichromats are not observed without 
excitation of the green-sensing ap- 
paratus, an experiment was per- 
formed to see whether through cen- 
tral mediation accessory effects at 
the spectral extremes could not be 
elicited in one eye upon illumination 
of the other with green light. The 
results were affirmative, thereby con- 
firming the special role assigned to 
the green receptors (253, 254, 352). 
Modification of Primary Conditions 

Accessory stimulation is held to 
produce its effect in many instances 
in ways not involving considerations 
of excitation or excitability of the 
primary sensory system. These refer 
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to accessorily induced modifications 
of the physical conditions under 
which primary stimulation is ap- 
plied. For example, the pupillary 
reflex may respond to an auditory 
stimulus. Here the quick contraction 
of the pupil with its slow subsequent 
relaxation would be expected to 
modify some aspects of the visual 
function. Since the amount of light 
entering the eye is affected by pupil- 
lary width, upon nonemployment of 
an artificial pupil apparent changes in 
visual threshold which follow upon 
auditory stimulation may be ascrib- 
able not to changes in excitation or 
excitability of the visual afferent 
system, but rather to changes, in- 
duced by accessory stimulation, in 
the physical conditions. Since the 
pupillary reflex is not without effect 
on accommodation, visual acuity 
also may undergo modification owing 
to the accessorily induced changes in 
the physical conditions accompany- 
ing primary stimulus action. 

Similarly, pain, vestibular stimu- 
lation, etc. modify the pupillary reflex 
and thereby may give rise to simu- 
lated accessory effects, As another 
instance, tactile stimulation around 
the ear appears to affect auditory 
threshold; but again, the result of 
such stimulation might more aptly 
be ascribed to reflex changes in the 
small muscles controlling the tension 
of the bony system of the middle ear 
than to changes in the auditory af- 
ferent system (267, pp. 87-88; 274, 
pp. 86-88). 


Conditioning 

The conditionability of sensory 
responses adds immensely to the 
range of sensory interaction and is 
considered a factor to be reckoned 
with though the accessory effect is 
not native but acquired. Work in 
the Soviet Union on conditioned 
"sensory reflexes" dates back to 1936 
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when Kravkov, Kekcheev, Bogo- 
slovkif, and Dolin independently 
demonstrated the conditionability 
of changes in visual responses. Thus, 
it was early shown possible to condi- 
tion changes in electrical sensitivity of 
the eye, critical flicker frequency, 
peripheral sensitivity, etc. to such 
indifferent stimuli as ticking, a tone, 
and elapsed time of eye exposure to 
the dark (16-18; 22-25; 50-52; 57; 
79-84; 124; 169-171; 183; 202; 233, 
p. 474; 256, p. 53; 259; 263, pp. 41- 
437265, pp. 36-39; 267, p. 35; 274, 
p. 192; 309; 339; 381; 396; 417; 418). 

To illustrate some of this work: 
Pshonik managed to induce changes 
in thermal sensation in response to 
accessory auditory stimulation by 
making of the latter a conditioned 
stimulus (57, 381). Dobriakova con- 
ditioned increase of gustatory sensi- 
tivity to the ticking of a metronome 
after having associated the latter 
with light—a stimulus which ordi- 
narily induces a positive accessory 
effect on gustatory sensitivity. In 
addition, it was found that symbols, 
denoting stimuli which heighten gus- 
tatory sensitivity, make manifest 
like effects (71, 72, 171). A general 
heightening of various visual and 
auditory sensitivities has been ob- 
Served to ensue through mere pres- 
ence of a subject in the experimental 
room. Frequent prior presence in the 
room while in a state of intense at- 
tention is here said to condition modi- 
fication of visual and auditory sensi- 
tivities to the sight of the experimen- 
tal room—the state of attention being 
thought of as in the role of “uncon- 
ditioned stimulus" (79; 80; 267, pP- 
85, 111; 272; 274, pp. 192-194).!* 

**Concentrated attention" has been as- 
signed an important role in the sensibilization 
of the receptors. ‘Attentive practice" am 
expectation also lead to sensibilization, the 
former being held capable of doing so on fr 
permanent basis (48, p. 209; 149; 172, p. 92i 
274, pp. 230-231 ; 396; 408; 416; 460). 
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CONCLUDING REMARKS 


Soviet research on sensory interac- 
tion has amassed over the past 24 
years a fund of data that, in spite of 
a deserved skepticism, cannot but 
strike one for its persistent consist- 
ency and *''logical," though novel, 
character. This research appears to 
demonstrate that all modalities un- 
dergo various modifications of sen- 
sory response on appropriate appli- 
cation of an accessory stimulus and 
that, where the primary stimulus is 
visual, the resulting modifications 
conform to striking patterns. 

Basic to one of these patterns is the 
fundamental role assigned to the 
green-sensing apparatus whose exci- 
tation is a requisite for the media- 
bility of accessory effects through 
simultaneous excitation of the non- 
green receptors. Of the latter there 
are those which upon excitation are 
reinforcing for accessory effects, while 
others are depressive. Thus, where 
sensitivity to blue-green, for example, 
is heightened, that to orange-red is 
lowered; and vice versa. 

The antagonistic relationship be- 
tween color receptors is traced to in- 
versely related ionic concentrations 
induced through autonomic nervous 
action brought into accessory play. 
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Thus, where ionic calcium predomi- 
nates, sensitivity to green is height- 
ened, to orange-red lowered; where 
ionic potassium predominates, re- 
verse sensitivities obtain. 

Other patterns of accessory effect 
and of reciprocal accessory action 
have been intensively investigated 
with characteristic results. However 
critical one may be of the Soviet 
studies, the cumulative results ob- 
tained with their apparent inner con- 
sistencies suggest, therefore, the 
worthwhileness of a re-examination 
of the whole problem of sensory inter- 
action. Recent restrictions on types 
of psychological research in the 
U.S.S.R. in favor of ''Pavlovian" 
methodologies and conceptions sug- 
gest that non-Soviet researchers are 
the ones to undertake the job (60; 
201; 272; 338-341; 434, pp. 69-70; 
473; 498-503; 505).!7 


17 Signs point to a possible relaxation of 
dogmatic prescription of allowable method- 
ologies and conceptions. Both Ivanov-Smo- 
lenskif and Bykov, the official “Lysenkos” 
of Soviet psychology, physiology, and related 
disciplines, who in their ruthless advocacy of 
a raw Pavlovism have exercised great power 
in these fields since 1950, have recently been 
censured in Pravda along with Lysenko for 
suppressing the views of those holding con- 
ceptions "deviant" from theirs (440). 
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This paper is an attempt to sum- 
marize the interrelationships of two 
seemingly separate disciplines that 
deal with the study of man. Biogra- 
phy, in all its diverse forms, attempts 
to describe human lives, and this is 
also one of the purposes of the science 
of psychology. Inevitably, these two 
fields have drawn upon each other in 
a manner beneficial to both, though, 
as will be seen, each could make still 
better use of the other than it does at 
present. 

From the earliest times, the best of 
the writers of biography have striven 
to describe not only the overt actions 
and recorded facts of their subjects’ 
lives, but also their personalities. 
The more perceptive writers have 
also realized that there is a connec- 
tion between a man and his deeds, 
that an understanding of his person- 
ality helps explain his accomplish- 
ments, and that his accomplishments 
throw light upon his personality. 
This insight is implicit, for example, 
in these famous lines from Plutarch’s 
life of Alexander the Great: 

My design is not to write histories, but 
lives. And the most glorious exploits do not 
always furnish us with the clearest discoveries 
of virtue or vice in men; sometimes a matter 
of less moment, an expression or a jest, in- 
forms us better of their characters and in- 
clinations, than the most famous sieges, the 
greatest armaments, or the bloodiest battles 
whatsoever (69, p. 801). 


Thus, the best biographers, like the 
best novelists, have always been stu- 
dents of human psychology, and the 
development of psychological science 
has tended to confirm insights long 
ago achieved. The novelist E. M. 
Forster has written: ‘Psychology is 


not new, but it has newly risen to the 
surface. Shakespeare was subcon- 
sciously aware of the subconscious” 
(35, p. 9). It is hardly necessary to 
belabor ‘this point, which has been 
made many times. C. K. Trueblood, 
for example, has elaborately described 
the sound psychological principles de- 
veloped by the French critic-biogra- 
pher Sainte-Beuve (81). Or, to cite 
an American illustration, the follow- 
ing statements taken from James 
Russell Lowell's sketch of Thoreau 
show obvious psychological under- 
standing: 

He condemns a world, the hollowness of 
whose satisfactions he had never had the 
means of testing. 

Those who have most loudly advertised 
their passion for seclusion and their intimacy 
with nature... have been mostly senti- 
mentalists, unreal men, misanthropes on the 
spindle side, solacing an uneasy suspicion 
of themselves by professing contempt for their 
pco had not a healthy mind .... His 
whole life was a search for the doctor (87, pp. 
200, 205, 203). 


But the tremendous development 
of psychology as an organized dis- 
cipline since the late nineteenth cen- 
tury has naturally had an impact 
upon the writing of biography. By 
1896 Havelock Ellis was calling biog- 
raphy a branch of “applied psychol- 
ogy” and bewailing the fact that 
most life writers knew nothing of the 
work of men like Wundt, G. Stanley 
Hall, Münsterberg, and Jastrow (32, 
pp. 96-97). The publication of 


1 The average biographer, Ellis wrote, was 
producing “a figure that is smooth, decorous, 
conventional, bien coiffé, above all, closely cut 
off below the bust . . . ” (32, p. 98). This was 
certainly an accurate picture of most (but not 
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Freud's The Interpretation of Dreams 
in 1900 and more especially of his 
Leonardo da Vinci a decade later 
opened up the possibility of psycho- 
analyzing historical figures, and of re- 
interpreting great masses of human 
history as well. 

At first, the influence of the radical 
Freudian thinking spread slowly. 
With psychologists themselves hesi- 
tant or openly scoffing, biographers, if 
they were even aware of psychoanaly- 
sis, did not rush to adopt the tech- 
nique. The earliest psychoanalytic 
biographies were written by European 
psychologists, followers of Freud, and 
were published chiefly in Freud's 
magazine Imago, and in other Ger- 
man professional journals (30). A 
notable exception to this was a study 
of Martin Luther by the American 
historian and Reformation scholar 
Preserved Smith, which appeared in 
1913 and explained Luther's career in 
terms of the Oedipus complex (78). 
But Smith neither pursued the 
method further nor transferred his 
conclusions about Luther to his other 
writings about the Reformation. Few 
historians, therefore, were even aware 
of his effort, and thus not stimulated 
to copy it. 

The development of the “new 
history under the leadership of James 
Harvey Robinson after 1910, with its 
emphasis on broadening the interests 
and methods of history, did not neg- 
lect psychology. Robinson, address- 
ing the American Historical Associa- 
tion in 1910, called for a general ex- 
pansion of history into new fields. 
“History,” he said, “must recognize 
that it is but one of several ways of 
studying mankind." Robinson called 
attention to Tarde's work in the field 


————————————— 


of all) late Victorian biographies. Ellis' in- 
dictment is an interesting precursor of Lytton 
Strachey's famous assault on the type in the 
preface of Eminent Victorians written in 1918. 
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of imitation, and warned his ¢ 
leagues that "the relations of ou 
reason to the more primitive instine 
which we inherit from our animal an- 
cestors . . . will never be understoo 
without social psychology" (72, 
74, 93). But while the “new” histor 
was readily accepted by histo 
historical writers and biographers did. 
not hasten to apply psychologue 
tools to their work. 
A great change in this situatie 
came about after World War I. Th 
experience of doctors dealing vit 
thousands of cases of war n 
which were obviously not the € 
ventional ‘‘shell shock," did much 
strengthen the Freudian th 
The appearance in English of Fi 
General Introduction to Psychoa: 
in 1920, and the popularizations 
psychoanalytic theory, spread t 
doctrine into nonprofessional ] 
(64, p. 329). Suddenly biographer 
began to see the possibilities—the 
sights into hidden motives, the e: 
from the limitations of "factual" 
ography, the sensationalism inhe! 
in an approach that emphasized 
with its healthy effect on sales. 4$ 
early as 1919 Harry Elmer Barnes, 
disciple of Robinson, had gone of 
record in support of the use of ps} 
choanalysis in biography (9),? and b 
the next year the rush was on. — - 
It would be of little value to 8! 
tempt to list all the biographies put 
lished in the twenties that mac 
conscious use of Freud and of hi 


? Another early example of the influe! 
Freud on biography can be found in 
Maynard Keynes's Economic Consequ 
the Peace (1919). In discussing Presi 
son's attitude toward the Versailles 
Keynes wrote: "In the language of 
psychology, to suggest to the Presiden! 
the Treaty was an abandonment of his 
fessions was to touch on the raw a ETC 
complex. It was a subject intolerable 
cuss, and every subconscious instinct 
to defeat its further exploration" (52 


critics within the ical 
movement like Jung and Adler, but 
a few of the most popular should be 
mentioned. Katharine Anthony's 
Margaret Fuller (7) was one of the 
earliest. R. V. Harlow's Samuel 
Adams (44), Harvey O'Higgin's The 
American Mind in Action (66), 
Joseph Wood Krutch's Edgar Allen 
Poe (53), Clement Wood's Amy 
Lowell (86), G. W. Johnson's Ran- 
dolph of Roanoke (48), and L. P. 
Clark's Abraham Lincoln (21) are 
among the best known. 

The appearance of such books as 
these touched off a debate among 
biographers, historians, literary crit- 
ics, and others which rivaled the 
general contest between the advo- 
cates of psychoanalysis and the con- 
servative forces of society. Naturally 
enough, the practitioners of the tech- 
nique were loud in their own defense. 
Leon Pierce Clark, the biographer of 
Lincoln and himself a practicing psy- 
chiatrist, argued that the ordinary 
biographer “has not fully exhausted 
the possibilities of his subject, be- 
cause of inadequate psychological 
training" (19) and cited "such com- 
petent [historical] authorities” as 
Robinson, Barnes, and 
Smith to prove that leading histori- 
ans recognized the value of psycho- 
analysis (20). 

The Freudian biographers tended 
to place great stress upon the close 
connection between a writer and his 
literary productions. They defended 
wholeheartedly their use of imagina- 
tive writings as source material for an 
understanding of the author’s per- 
sonality. Discussing Poe, Joseph 
Wood Krutch declared: “The forces 
which wrecked his life were those 
which wrote his books." The man 
ard his works, Krutch argued, were 
"nearly identical" (53, pP. 18-19). 
Clement Wood, himself a poet, wrote 
in his Amy Lowell: “A poet’s work 


poetry a portrait of the author...” 


ing the inner life than to make the 
infinitely greater mistake of ignoring 
its existence and its import” (63, 
p. 5). 

Perhaps the supreme tribute to 
psychoanalytical biography was paid 
by the Marxian historian Matthew 
Josephson: 

The dialectics of subconscious motive un- 
derlying the conscious to be as fruitful 


100). 


But against such opinions as these 
can be placed a mass of contradictory 
statements, for many biographers 
have attacked the method. First may 
be mentioned those who seemed to 
have been motivated chiefly by prud- 
ery. It is unnecessary to do more 
than quote a few lines to show the 
nature of their argument: 

The worst foe of biography that has yet ap- 
is the disciple of Freud, who crawls 

like a snail over all that is comely in life and 
‘Apart from all their sex nonsense and their 

ion of their characters’ 


psychology, [psychoanalytica] Lar aa 


authors quite realized who and what their 
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Actually, very few critics have re- 
sorted to such arguments as these. 
But many have based their opposi- 
tion on a denial of Freudian princi- 
ples, particularly his insistence upon 
the dominant importance of sex. 
George Bernard Shaw once wrote to 
Frank Harris, who was preparing a 
life of the playwright: 

O Biographer, get it clear in your mind that 
you can learn nothing about your sitter . . . 
from a mere record of his gallantries .... If 
I were to tell you every such adventure that 
I have enjoyed, you would be none the wiser 
as to my personal, nor even as to my sexual 
history (quoted in 15, p. 755). 


More specifically, Emil Ludwig wrote 
in Dr. Freud, which is a bitter attack 
upon psychoanalytical biography: 
“When Freud inspects a figure out of 
history he resembles a physician who 
instead of examining his patient’s 
whole body looks at a single organ, 
usually the genitals, and upon it 
founds his diagnosis” (58, p. 177). 

Less prejudiced critics have 
claimed that whatever its value as a 
therapeutic tool, the use of psycho- 
analysis serves no purpose in biogra- 
phy. Historian Jacques Barzun, for 
example, has argued that the typical 
Freudian biography merely substi- 
tutes a new jargon for old expres- 
sions, without really throwing light 
on the subject’s personality (11, pp. 
277-278). Similarly the biographer 
Phillip Guedalla once wrote of “the 
familiar cake iced thinly with a little 
mental slang imported from Vienna” 
(41, p. 115), and Hesketh Pearson 
has made essentially the same point 
(68, pp. 313-315). 

A powerful complaint raised by the 
foes of psychoanalysis in life histories 
has been the lack of evidence upon 
which to base the study of a per- 


3 Psychological biography, Guedalla wrote 
on another occasion, is "one half of something 
we all knew before, and... the other half of 
something which is not so” (42, p. 928). 
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sonality no longer living. Dui 
Malone, editor of the Dictiona 
American Biography, has put it init 
simplest terms. “It is a pity," he 
wrote in his essay “Biography 
History," "that no more auth 
information is available abou 
the crucial early years of nurt 
(59, p. 143). André Mauro 
the same point in his Aspects of 
Biography. “Who is keeping a re 
of Bertrand Russell's dreams, so: 
the Freudian biographers may 
pret them at a later date?" he 
(60, p. 90). The anthropo 
Alexander  Goldenwiser, ce! 
sympathetic to psychoanalyti 
ory in itself, has written: 


It is, of course, tempting to apply. 
sights of psychoanalysis to a speculative in- 
terpretation of historical characters. 
But the procedure is hazardous in the ex 
and, more important still, it stands in G 
diction to the avowed professions of the 
choanalysts themselves (39, p. 416). 


Finally, the most perceptive 
have tended to concentrate the 
tack on the manner in which psyc 
analytical biographers have used 
Freudian constructs of subli 
and reaction formation. Harry 
Barnes, to cite a single objectio! 
instance, was able to explain Ales 
der Hamilton’s “authoritarian” 
sonality as resulting from the fal 
that he was never subject to the c0! 
trol of a stern father. But in the s 
article Barnes accounted for Je 
son's anti-authority complex by € 
plaining that his father died when 
was only fourteen—before his “fathe 
image” had had time to mature! ^4 
is significant," Barnes commen 
learnedly, “that Jefferson’s anti 
to his father was so infantile 
deep-seated that it was sc 
raised to consciousness." 
Jefferson's references to his. 
were “reverential and awe-insp: 
which to Barnes was "obvious" prod 
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that he hated him (10, pp. 28-37). It 
was this kind of analysis which led 
Howard Mumford Jones to complain: 
“So long as any symbol can be trans- 
lated into any other symbol from the 
subject's past, so long the psycho- 
analyst may roam at will" (49, p. 
114). Bernard De Voto, never a man 
to compromise a strongly held opin- 
ion, has even said: ‘Psycho-analysis 
has no value whatever as a method of 
arriving at facts in biography. No 
psycho-analytical biography yet writ- 
ten can be taken seriously—as fact” 
(26, p. 185). To which may be added 
the satire of Stephen Vincent Benét: 
And as for the poor devil's really loving his 
wife—well, I know that every authentic ut- 
terance of his on the subject says he did, and 
there are those dedications of his books, that 
sound fairly sincere. But honestly, he simply 
hated her all the time .... Don't you realize 
that his unconscious portrait of her as the de- 
generate scrubwoman with leprosy in chapter 
four of “Wedded and Wooed" represents his 
real feelings toward her? (12, pp. 169-170). 
It is clear that some of the argu- 
ments against psychoanalytical biog- 
raphy have had great force. The ini- 
tial enthusiasm of the twenties 
slackened off rapidly in the thirties, 
and while such studies still occasion- 
ally appear, they are relatively rare 
today. In a recent survey of British 
biography by V. S. Pritchett, only 
one book (a life of Emile Zola by 
Angus Wilson) was listed as utilizing 
“the Freudian approach,” and Amer- 
ican biography has followed the same 
trend (70). Examination of the 
Psychological Abstracts reveals a simi- 
lar decline of psychoanalytical stud- 
ies of historical personalities by psy- 
chologists. Indeed, Franklin Fearing 
pointed out as early as 1927 that few 
academic psychologists had ever ac- 
ott espoused the method (33, p. 
'The rejection of psychoanalytical 
techniques by most biographers has 
not meant a corresponding disavowal 
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of all psychology. Some have argued 
that is merely “human 
nature” and that no formal study of 
the subject is necessary,‘ but the 
number who have paid tribute to the 
importance of the subject is far 
larger. As early as 1912 the writer 
Allen Johnson, while protesting 
against merely speculative studies of 
personality, called for greater aware- 
ness of the complexities of human 
character. The biographer should 
pay more heed to the early life of his 
hero instead of “joyously trans- 
port [ing] the inchoate statesman into 
the national arena, where he will 
thenceforth shape the course of em- 
pire" (47, p. 394). Johnson also sug- 
gested that an intensive study of 
great men might throw light on the 
problem of leadership, an idea that 
modern psychology is certainly in- 
vestigating (47, p. 403). 

Later writers have touched upon 
related points. An English critic, 
after denouncing the way psycho- 
analysis has been “ridiculously 
abused” by biographers, has praised 
the “weapon of psychology” which 
provides the modern biographer with 
“a far better surgical dissecting in- 
strument than his predecessor had” 
(27, p. 128). “The new psychology,” 
this critic writes, ‘‘has infused the art 
with new blood” (28, p. 19). An 
American student of modern biogra- 
phy has put it this way: “I do believe 
that somewhere between the ex- 
tremes of the conventional method 
and the purely psychological there 
lies a mean,” while Claude Fuess, an 
experienced practitioner of the art, 


‘Hesketh Pearson, for example, has denied 
all formal knowledge of psychology. "Dr. 
Freud... is to me but a name." To persons 
who have claimed that his biographies show 

ological insights, his answer is that this 
is “simply due to my own common sense and 
intuition” (68, pp. 313, 315). (See also 24, 
p. 4.) 
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after examining the dangers of peer- 
ing into the unconscious, concludes: 
“Yet with all these reservations, I am 
sure that the chief contribution made 
in recent years to the art of biography 
is the application of psychology to 
the interpretation of personality" 
(36, p. 351). 

Most such statements, while well 
intentioned, are extremely vague. 
What these writers seem to be getting 
at in praising psychology as a tool of 
biography is difficult to tie down. 
But an effective summary has been 
offered by the historian Edward M. 
Hulme: Psychology has led to “‘the 
wholesome skepticism with which we 
have come to regard many of the al- 
leged facts of history." It has made 
it possible to deal more satisfactorily 
with the conflicting statements of eye 
witnesses, with autobiographical 
writings and other personal docu- 
ments, with human habits and mo- 
tives—in short, "it helps to account 
for many facts, and also to trace the 
series of mental operations that come 
between the facts and the written 
record of their observation” (45, pp. 
175-178). 


Yet psychology can offer biography 
something more than this, though bi- 
ographers have not availed them- 
selves of it. In a few cases, where the 
evidence is plentiful and the condi- 
tions clear, psychoanalysis can cer- 
tainly be used with justice. In the 
case of George III, who was clearly 
insane, and whose madness was thor- 
oughly examined by contemporaries, 
the records are adequate, and a study 
such as Guttmacher's America's Last 
King is perfectly acceptable to the 
most conservative historians (43). 
On the other hand, it may be argued 
that where the ordinary evidence is 
extremely skimpy, as in the case of 
Leonardo, a psychoanalytic interpre- 
tation like Freud's may be quite use- 


ful. When conventional evidence is 
missing, the biographer must guess in 
any case, and an intelligent guess 
based on psychoanalysis, provided it 
is clearly labeled a guess (as Freud's 
was), can surely be defended. 

Even in the vast majority of biog- 
raphies, which fall between the two 
extremes, no one should object to the 
use of Freudian techniques if they are 
explicitly described as speculations, 
and if known facts are not twisted or 
ignored in order to bring the subject 
into a preconceived pattern. Much 
of the criticism that has been aimed 
at psychoanalytical biography has 
arisen from the dogmatic way in 
which the authors of such works have 
stated their cases. As Bernard De 
Voto has said: “Must is the mecha- 
nism of psycho-analytical biography. 
... Biography proper is not con- 
cerned with the must but only with 
the did” (26, p. 188). If psychoana- 
lytical biographers would confess to 
the speculative nature of their con- 
clusions, their critics would be less 
bitter. 

Aside from psychoanalysis, biogra- 
phy could benefit from other psycho- 
logical techniques for the study o 
personality, particularly from the 
scientific analysis of personal docu- 
ments. This is a field that has been 
badly neglected. When the Social 
Science Research Council conducted 
its inquiry into the use of personal 
documents in psychology; history: 
anthropology, and sociology, its his- 
torian, Louis Gottschalk, begged the 
question and presented a paper W ic 
confined itself almost wholly to an 
analysis of general historical met i 
“For the historian the method 0 
dealing with human documents is 4 
of historical method," he wrote K 
p. 6), ignoring the host of ideas adap 
able to historical study in S: 7 
Allport's “The Use of Personal Doc 
ments in Psychological Science pu 
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lished in 1942, three years before his 
own work appeared. Allport's cogent 
analysis should be inspected by every 
biographer, both for its summary of 
the work of students of personality 
and for its balancing of the pros and 
cons of the usefulness and reliability 
of personal materials (5). 

Certainly there are many ways in 
which biographers could apply knowl- 
edge gathered by workers in psy- 
chology. Recent advances in the 
study of graphology, for example, 
ought to be considered carefully. 
While it would be foolish to interpret 
a historical figure only in terms of his 
handwriting, the rich ore that this 
mine of expressive movement can 
yield should be dug out, refined, and 
forged into a handy biographical 
tool, for, as Allport and Vernon have 
pointed out, handwriting is really 
"brain writing” (6, p. 187). Indeed, 
the entire field of projective and ex- 
pressive techniques, ignored by biog- 
raphers, should be searched for 
methods that might be applicable to 
the study of persons no longer living. 

Unfortunately, the most valuable 
tools, like the Rorschach and the 
TAT, are of no use in historical in- 
vestigations, but there are others 
that might well be employed. Certain 
forms of semantic analysis of both the 
written and spoken word could be 
applied to the expressions of figures 
from the past, for example, D. P: 
Boder's discovery that there are great 
variations in the use of adjectives and 
verbs in different kinds of writing, 
and also within the writings of indi- 
viduals over the course of time. 
Boder's study of Emerson's journals 
and the letters of William James 
brought out many interesting rela- 
tionships, such as the fact that the 
adjective-verb ratio in James's letters 
varied significantly according to the 
sex of his correspondent (13, pp. 
340-341). The literature in this field 
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has been summarized by F. H. San- 
ford (74, 75), who concludes: “Char- 
acterizations of style appear also to 
be characterizations of persons” (75, 
p. 198), a point which biographers 
could ponder with profit. 

There are other tools that might 
well assist the biographer in digging 
out the complexities of his subject's 
personality, for the most important 
developments in this area have gone 
beyond the counting of mere word 
forms. John Dollard and O. H. 
Mowrer have developed a means of 
measuring tension as expressed, often 
subtly, in personal documents (29). 
By counting the words, sentences, 
and “thought units" that express 
some form of discomfort (D), and 
those indicative of relief or reward 
(R), they have arrived at a Discom- 
fort-Relief Quotient (DRQ), which 
can be determined by the equation 
DRQ=D/D+R. Thus, hidden emo- 
tional states may sometimes be un- 
covered. That this device might be 
valuable to biographers seems obvi- 
ous. (See also 51.) 

Also of great potential usefulness 
to biographers is the study that 
Alfred L. Baldwin called “Personal 
Structure Analysis: A Statistical 
Method for Investigating the Single 
Personality" (8). Baldwin took a 
series of letters written by a single 
subject and classified them according 
to the goals, attitudes, and objects 
discussed. The resulting analysis was 
based on two principles: (e) The more 
frequently an item appears, the more 
important it is in the individual's 
personality structure, and (b) the 
mentioning together of items more 
often than would be likely according 
to chance indicates a relationship of 
these items in the personality of the 
subject (8, p. 168). Although a sim- 
ple reading of the letters pointed up 
many such relationships, the mechan- 
ical counting and sorting of the ma- 
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terial highlighted a number of others 
which made the letter writer's per- 
sonality (and behavior) more intel- 
ligible. If this technique was helpful 
to a trained psychologist, it would 
be even more useful for the average 
biographer. 

Closely related to the Baldwin 
study is the method of ‘‘value- 
analysis" developed by Ralph K. 
White (84, 85). This technique is ap- 
plicable to autobiographical writings; 
White developed it by studying 
Richard Wright's Black Boy. First he 
attempted to evaluate Wright's per- 
sonality on the basis of an ordinary 
reading of the book. Then he applied 
value-analysis, going over it again 
classifying and counting each value 
judgment made by Wright. He un- 
covered more than half a dozen rela- 
tionships, some of great importance, 
which he had missed in reading the 
book, but which, upon further 
thought, fitted well into the general 
picture he had formed (84). 

The particular value for biogra- 
phers of techniques like these is that 
they can be employed by nonspecial- 
ists. The proper recognition and 
classification of the material requires 
training it is true, but nothing like 
the work that must be put in by any- 
one seeking to use psychoanalysis. 
White claims that his method can be 
mastered in from ten to twenty hours 
of study and practice (84, p.441), 
surely not an unreasonably long ap- 
prenticeship for a writer interested in 
getting inside his hero. Since these 
methods are plainly supplementary 
ones and futile to apply unless com- 
bined with subjective study, they 
ought not to offend the sensibilities of 
those who object to the employment 
of "mechanical" techniques in ''ar- 
tistic" problems. Some of these 
methods ought to be given at least a 
trial by biographers. Surely, as Saul 
K. Padover has written, to “‘illumi- 
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nate the internal dynamics of a 
son" one requires “imagination, p 
chological insight, psycho-social 
niques” as well as the convention 
documents (67, p. 16). 
At the same time, life writers m 
beware of overemphasizing the "se 
entific” approach to personality; Thi 
biographer can learn from the] 
chologist, but he cannot redu 
presentation of personality to &- 
ence. 1 
In 1924, when the rage for W 
"new" Freudian psychology wal 
its height, the American Poli! 
Science Association appointed at 
mittee which, among other thim 
investigated the possible applicati 
of psychology to the science of gt 
ernment. The conclusion arri 
might well be accepted as the p 
judgment of the usefulness of 
chology in biography. ''Probabl 
general sentiment of the comm 
toward psychology,” reported cl 
man Charles E. Merriam, “woul 
expressed by the phrase, com amor 
ma non troppo" (61, p. 469). 


Just as the first biographers 
intuitive psychologists, so the t 
philosophers, considering the 
questions of human psychology, We 
influenced by what they knew of m 
who had lived before them as í 
by their own contemporary o 
tions. Of course, much of this. 
mation came from biographies. 
it was the publication of D 
Origin of Species in 1859 which 
led to serious study of the liv 
men of the past by psycholo 
Environmentalists saw in the pt 
ple of natural selection proof o! 
idea that man's surroundings 
shaped his development, and $ 
to trace its influence in the bi 
phies of former generations. 
who stressed the importan 
heredity were driven in the s@ 
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rection, for by the nature of things it 
was difficult to study the genetic 
transmission of human character- 
istics in any other way (79, p. 67). 

Since the records of ordinary men 
have seldom been preserved by his- 
tory, most of the work that was done 
dealt with the extraordinary—with 
the "great" men of former times— 
and sought in one way or another to 
deal with the question: What is the 
true nature of eminence? Thus, Fran- 
cis Galton, in his pioneering work 
Hereditary Genius, published in 1869, 
wrote: “Is reputation a fair test of 
natural ability? It is the only one I 
can employ . . . " (37, p. 37).* Gal- 
ton's investigations led him to con- 
clude that “men who are gifted with 
high abilities . . . easily rise through 
all the obstacles caused by inferiority 
of social rank," but that those who 
have all the social advantages cannot 
succeed “unless they are endowed 
with high natural gifts” (37, p. 43). 
Others, of course, have come to dif- 
ferent conclusions (2, 34, 76), but per- 
force have turned to the same kind of 
biographical evidence to “prove” 
their points. The methods developed 
by Galton and those who followed 
him in the study of eminent persons 
illustrate well the use that psycholo- 
gists have made of biography. 

In his studies, Galton drew upon 
biographical collections (Foss'si Lives 
of the Judges, Lord Brougham's 
Statesmen of the Reign of George III, 
Middleton’s Biographia Evangelica, 
and other "ordinary small biographi- 
cal dictionaries" which were them- 


‘Over fifty years of research did not pro- 
vide a better answer to Galton's question than 
his own. In 1926 Catharine C. Cox wrote in 
The Early Mental Traits of Three Hundred 
Geniuses: “Since the problem of the early 
mental development of great men cannot be 
approached directly, because of the lack of an 
objective criterion of greatness, it is ap- 
Proached indirectly through the study of 
eminent men...” (23, p. 31). 


politics, science, philosophy, 

arts (37, pp. 57, 106-107, 259, 324). 
In a similar way, William James in 
"Great Men and Their Environ- 


works, and reached conclusions that, 
in general, followed Galton's asser- 
tion that heredity was the dominant 
factor producing human eminence 
(46, 87, 88). Charles B. Davenport's 
Naval Officers: Their Heredity and 
Development, a practical search for an 
accurate method of selecting future 
naval officers on the basis of “juvenile 
promise,” made wide use of general 
compilations like Who's Who, 
Burke's Peerage, and the Encyclo- 
pedia Britannica, and of individual 
biographies whenever they were 
available (25, p. 3). 

Investigators who rejected the 
conclusions of Galton and his fol- 
lowers nonetheless worked with the 
same kind of sources. John Fiske’s 
reply to James's “Great Men,” called 
“Sociology and Hero-Worship: An 
Evolutionist’s Reply to Dr. James” 
(34), and a similar rebuttal by Grant 
Allen (2) drew from biographical 
information. 

These studies were all more or less 
haphazard in their choice of evidence 
but with the burgeoning of compre- 
hensive biographical dictionaries in 
the late nineteenth century more 
scientific statistical investigations be- 
came possible. This type of research 
was carried on by historians and 
biographers as well as by psycholo- 
gists (Henry Cabot Lodge's ‘‘Distri- 
bution of Ability in the U. S." [55] 
was perhaps the earliest work of this 
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kind), but the psychologists devel- 
oped the method most fully. James 
McKeen Cattell, who took an inter- 
mediate position in the debate over 
the importance of nature and nur- 
ture, compiled his well-known list of 
1,000 eminent men by computing the 
amount of space devoted to the indi- 
viduals described in six biographical 
dictionaries (18). The use to which 
Havelock Ellis placed the British 
Dictionary of National Biography and 
the "over three hundred biographies 
of individuals" which he read in order 
to supplement that massive collection 
in writing his Study of British Genius 
(31) was essentially similar to Cat- 
tell’s, although slightly more sub- 
jective. And even a much less 
scientific investigator, Cesare Lom- 
broso, whose The Man of Genius (56) 
attempted to demonstrate a close re- 
lationship between genius and in- 
sanity, based most of what he had to 
say on information obtained from life 
histories. 

The study of genius which made 
the most comprehensive use of biog- 
raphy was Catharine C. Cox's The 
Early Mental Traits of Three Hundred 
Geniuses (23). Selecting her subjects 
from Cattell's list, she sought facts 
indicative of the IQ's of these men in 
their personal writings and in the 
best available biographies. The re- 
sult was both detailed and convinc- 
ing, drawn from several thousand 
biographical works. 

Many other investigators of the 
nature of genius have also analyzed 
life histories. Evelyn Raskin, in her 
comparison of literary and scientific 
leaders (71), studied Lippincott's Bi- 
ographical Dictionary, the Encyclo- 
pedia Britannica and the biographi- 
cal evidence contained in numerous 
histories of science and of literature. 
Similarly, Edwin L. Clarke's Ameri- 
can Men of Letters: Their Nature and 
Nurture (22) and Cora S. Castle’s A 
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Statistical Study of Eminent Women 
(17) deserve mention in any list of 
works of this type, as do also the 
more recent Studies in Genius by 
Walter G. Bowerman (14), Mapheus 
Smith's "Racial Origins of Eminent 
Personages” (77), and R. K. White's 
“The Variability of Genius" (83). 

Another use to which biography 
has been put by psychologists has 
stemmed from the work of the psy- 
choanalysts. ^ Psychoanalytical life 
histories, already described as an 
aspect of psychology's influence on 
biography, also illustrate the reverse; 
in preparing case studies of historical 
characters, psychoanalysts have in- 
evitably drawn upon standard biog- 
raphies. These case studies? are, how- 
ever, essentially unlike psychoana- 
lytical biographies. They are more 
methodological demonstrations than 
attempts to explain the subjects con- 
sidered. They are meant to illustrate 
psychoanalytical principles, whereas 
psychoanalytical biographies use the 
principles only to explain the person- 
alities of the characters they describe. 

In the field of social psychology; 
some writers have made use of biog- 
raphy. Hadley Cantril, in his Psy- 
chology of Social Movements (16), 
studied works on Hitler to explain 
Nazism, and on Father Divine to ex- 
plain that individual's cult. A similar 
but far more elaborate example 0 
this type of work is offered by 
Gustave Gilbert's Psychology of Die 
tatorship (38). Although most © 
Gilbert's material was drawn from 
his interviews with captured Nazi 
leaders, his chapter on Hitler de- 
pended considerably upon biogra- 
phies of Der Führer. 

Other psychologists have turned to 
biography in an effort to discover 
patterns of human development. 

* The works of this type are so numerous 


that it would be fruitless to attempt even o 
abbreviated listing of them here. 
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Charlotte Bühler's work with over 
two hundred life histories led to her 
conclusion that most lives can be 
divided into five stages: youth, 
period of trials, maturity, achieve- 
ment, and decline (73). Similarly, 
Lehman's many studies of the years 
of peak performance in various fields 
of specialization illustrate an adapta- 
tion of biography to psychological 
purposes (see especially 54). 

Biography has also been developed 
as a teaching technique. R. M. 
Elliott’s course in ‘Biographical 
Psychology" at the University of 
Minnesota, in which each student 
studies biographical works about 
some person and writes a paper on 
"the success and lapses of insight 
with which the development of the 
human being is portrayed," has long 
been well known. Gordon W. Allport 
at one time adopted William E. 
Leonard's revealing autobiography 
The Locomotive God as a text for his 
course in psychology at Harvard (3), 
using it as *a hatrack for an entire 
course of study." At least one other 
psychologist, Henry Clay Smith, 
then at the University of Vermont, 
has followed Allport's lead. 


Psychologists have been neither as 
free nor as extreme in commenting on 
biography as biographers have in 
commenting on psychology, but they 
have expressed a considerable range 
of opinions. In general, those who 
have used biographies in their re- 
search have assumed the factual in- 
formation contained in them to be 
reasonably accurate, and have tried 
to follow the most "authoritative" 
authors and to cross-check their evi- 
dence whenever possible. But they 
have made numerous technical criti- 


1 Elliott, R. M. Personal communication, 
January 7, 1954. 

* Allport, G. W. Personal communication, 
December 30, 1953. 
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cisms. Biographers seldom realize, 
Havelock Ellis argued, that their dis- 
cipline demands "something more 
than pure literary aptitudes." His 
study of the Dictionary of National 
Biography led to him complain of a 
general lack of method among life 
writers. Often such important factors 
as whether or not a subject had ever 
married, his precise position in his 
family, and even so elementary a 
matter as a competent physical de- 
scription were not to be found (31, 
pp. 16-18). Cox has warned that the 
average biographer is a synthesizer, 
not an analyzer, from whose work the 
perceptive psychologist must strip 
every vestige of interpretation (23, p. 
24), and Evelyn Raskin has noticed 
that biographies of scientists fail to 
place enough emphasis on the per- 
sonalities of the individuals they de- 
scribe (71, p. 24). Alfred M. Tozzer 
has complained of the way many 
biographers distort or ignore the 
simplest laws of heredity in a frantic 
“cytological search for famous chro- 
mosomes" that will "explain" a 
man's fame in terms of the character- 
istics of his ancestors (80, p. 422). 

One of the strongest criticisms of 
biography was made by R. K. White 
after completing a study of the versa- 
tility of geniuses based on Cox's data: 

Biographical data of the usual type are by 
their very nature relatively unreliable. It is 
exasperating, for one who holds before himself 
a high standard of scientific precision, to be 
eternally conscious that his rock-bottom facts 
are not scientifically precise. On the con- 
trary, they are often snap judgments, made 
by untrained observers, often by prejudiced 
observers, without reference to established 
norms (83, p. 481). 

Finally, the comment of Lewis M. 
Terman should be mentioned. Ter- 
man decried "the utter inability ofa 
majority of otherwise cbmpetent bio- 
graphical writers to appraise and 
interpret .".. the early mental de- 
velopment of their subjects." He at- 
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tributed this to a lack of "detailed 
acquaintance with the age norms of 
childhood performance," He illus- 
trated his point by referring to Karl 
Pearson's life of Francis Galton, in 
which the author claimed that as a 
child Galton had demonstrated no 
significant signs of precocity, despite 
the fact that his book described 
youthful accomplishments that led 
Terman to believe that Galton's IQ 
was ''not far from 200" (23, p. v). 


In general, biography has not 
made the impact upon psychology 
which that relatively youthful disci- 
pline has made upon it. The enthusi- 
asm that was generated by Galton's 
Hereditary Genius has pretty gener- 
ally been dissipated by time. In 
other psychological areas, biography 
has been used frequently, but in 
rather isolated special cases (16, 38), 
or in experiments that have at- 
tracted the interests of single in- 
vestigators (54). 

But in addition to what biography 
can offer to the modern psychologist 
by way of source material, it can also 
provide him with many brilliant 
examples of character portrayal 
which demonstrate the importance of 
literary ability in any work seeking 
to explain that complex organism, 
man. 

After completing his exhaustive 
study of fifty men of college age, 
Henry A. Murray saw this point 
clearly: 

Any conceptual formulation of man's ex- 
perience . . . must necessarily do violence to 
human feelings. ... This will be so because 
it is the substitution of heartless, denotative 
referential symbols for the moving immediacy 
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of living. By employing such a scheme a per 
son's vital movements, once warm and pat 
sionately felt, become transformed into a 
cruelly commonplace formula, which dis 
possesses them of unique value.... The 
artist's representation of an experience, on the 
other hand, is a re-invocation of the original 
... equally immediate, exciting and intense 
(65, pp. 17-18). 


Murray therefore sacrificed some 
of the scientific objectivity of his 
studies on the altar of literary style. 
The same point has been made by 
Allport: “The greatest failing of the 
psychologist at the present time is his 
inability to prove what he knows to 
be true" (4, p. 9). Murray and 
Allport were suggesting that psy- 
chologists can learn from creative 
literature, but biography, with its 
stress on both artistry and accuracy, 
can, in its finest form, be a still better 
teacher. 

In conclusion, it seems clear that 
biography and psychology have al- 
ways been related disciplines. Early 
intuitive interrelationships have been 
supplemented in recent times by 
more conscious and more scientific 
efforts in each field to profit from the 
work of the other. Psychological 
knowledge has aided modern biogra- 
phers in understanding their sub- 
jects, and biographies have been u 
by psychologists interested in the 
study of genius, heredity, soci 
movements, and many other topics 
Yet there remains room for further 
development. Scientific psychologi- 
cal methods ought to be thoroughly 
investigated by biographers, while 
psychologists should more often turn 
to biography as providing a source o 
psychological data. 
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CLARIFICATION OF AN AMBIGUITY IN HULL'S 
PRINCIPLES OF BEHAVIOR 


J. J. CLANCY, L. THOMAS CLIFFORD, AND ALLEN D. CALVIN 
Michigan State College 


In Principles of Behavior (2), Hull 
attempted to set forth a rigorous 
hypothetical-deductive system. This 
tremendous step forward in psycho- 
logical system building has inspired 
more research since its publication 
ten years ago than any other book in 
the area of experimental psychology. 
For this reason any incongruity or 
ambiguity in one of the basic postu- 
lates or corollaries calls for prompt 
clarification. 

An example of such an ambiguity 
occurs in Corollary VII on page 
154. This corollary reads as follows: 
"Equal differences in the delays of 
reinforcement of two competing acts 
lead with equal practice to a lower 
per cent of preferential choices of the 
act associated with the shorter delay 
when the two delays are large (i.e., 
when the ratio is relatively coarse) 
than when they are small (i.e., when 
the ratio is finer)." 

Here Hull is obviously using the 
terms "coarse" and "fine" to mean 
that the larger the fraction the coars- 
er the ratio, i.e., 1:2 would be coarser 
than 1:3. However, in a preceding 
statement Hull notes, "Let us take, 

_ for example, the same theoretical ar- 
rangements assumed above, with the 
exception that the delays are of 30 
and 90 seconds respectively, which 
gives the organism the relatively 
coarse ratio of 1 to 3 instead of 1 to 
2, as previously" (p. 152). Again in 
his summary he reiterates, "More- 
over, it follows from these same prin- 
ciples that the point of maximum 
ease of discrimination of a coarse 
tatio, such as 1 to 3, will appear at 
smaller absolute values than will be 


the case with a fine ratio, such as 1 
to 2” (p. 159). This usage of coarse 
and fine stands, of course, in direct 
contradiction to the usage of these 
terms in the corollary cited above. 
We are thus faced with an ambigu- 
ous situation, which can be resolved 
in two possible ways. We can reverse 
the words "large" and "small" and 
change the corollary to read: Equal 
differences in the delays of reinforce- 
ment of two competing acts lead with 
equal practice to a lower per cent of 
preferential choices of the act associ- 
ated with the shorter delay when 
the two delays are small (i.e., when 
the ratio is relatively coarse) than 
when they are large (ie., when the 
ratio is finer); or we can reverse the 
words “coarse” and "fine" so that the 
corollary reads: Equal differences in 
the delays of reinforcement of two 
competing acts lead with equal prac- 
tice to a lower per cent of preferential 
choices of the act associated with the 
shorter delay when the two delays 
are large (i.e., when the ratio is rela- 
tively fine) than when they are small 
(i.e., when the ratio is coarser). 
These two alternatives are dia- 
metrically opposed in meaning. If 
we interchange the words large and 
small, as we did in the first alterna- 
tive, the corollary states that there 
will be a lower percentage of choices 
when the two delays are small, i.e., 
when the ratio is relatively coarse. If 
we interchange the words coarse and 
fine, as we did in the second alterna- 
tive, the corollary states that there 
will be a lower percentage of choices 
when the two delays are large, i.e., 
when the ratio is relatively fine. 
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Fortunately Hull provides us with 
the answer as to which is the proper 
alternative to choose. He states that 
this corollary finds empirical confir- 
mation in a study by Anderson (1). 
An examination of Anderson's data, 
which Hull presents, indicates that 
the second alternative is the one that 
is in accord with these findings. Thus 
it seems apparent that this corollary 
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In a recent issue of this journal, 
Burke (1) criticizes earlier discus- 
sions of one-tailed and two-tailed 
tests (3, 4, 5) and suggests need for 
caution in the application of one- 
tailed statistical tests to psychologi- 
cal research designs. The writer is 
in accord with several implications 
of Burke's note. As is true for most 
statistical designs, abuses would be 
reduced markedly were the test model 
completely specified and justified 
in terms of the purpose of investiga- 
tion, before data are viewed by the 
investigator. To be guided by the 
data in the specification of hypotheses 
and statistical tests is a grave breach 
of the rules of experimental verifica- 
tion (cf. 6). 

The argument presented by Burke, 
however, is more than a plea for care- 
ful consideration of the choice of test 
models for given experimental prob- 
lems. It is stated that the selection 
of a one-tailed test model requires 
that an investigator be willing to 
"publicly defend the proposition . . . 
of no difference" if results actually 
show a large difference in the direc- 
tion opposite to that predicted (1, p. 
387). The proposition appears in- 
defensible, since it demands arguing 
that a particular observed difference, 
no matter how large, is only a sam- 
pling departure from zero. If ac- 
cepted, Burke's argument should 
lead to universal avoidance of one- 
tailed tests. 

Consider the following experimen- 
tal problems, selected for simplicity 
as single variable designs: (2) On the 
basis of a certain behavioral theory, 
we might predict that an experimen- 


tal condition imposed upon subjects 
in the population under study would 
raise the mean level of performance 
on a given task. The theory provides 
a prediction (an alternative hypothe- 
sis) that the mean for the experimen- 
tal group population will exceed the 
mean for the control group popula- 
tion. The hypothesis under test is 
that the mean for the experimental 
population is the same as or less than 
that for the control population. We 
should like a statistical test that 
will yield a decision: either the data 
are consistent with the hypothesis 
under test, or we reject the hypothe- 
sis in favor of the alternative. (5) In 
a field of applied psychology a new 
diagnostic technique is developed and 
is to be adopted if, and only if, we 
are confident that it is better than the 
current technique which would be 
replaced. Assuming the availability 
of a suitable criterion, the two tech- 
niques are applied to comparable 
samples or to the same sample; 
interest resides in the extent to which 
the parametric proportion of success- 
ful predictions using the new tech- 
nique exceeds that using the old. 
A statistical test is to supply a de- 
cision: either the new technique is 
no more adequate than the old, or 
the new technique is more adequate. 

For the class of problems illus- 
trated by these two examples, the 
hypothesis under test is not simply 
one of no difference. We wish to test 
the hypothesis that the algebraic 
difference between parametric mean 
performance under experimental and 
control conditions is zero or negative 
against the alternative hypothesis 
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that the difference is positive. It 
is meant to stress this formulation of 
the one-tailed statistical test with 
the greatest possible emphasis. With 
this formulation, it is apparent that 
acceptance of the hypothesis under 
test does not demand defense of a 
proposition of no difference; the ob- 
served difference, whether negative, 
zero, or slightly positive, simply does 
not allow acceptance of the alterna- 
tive hypothesis at the level of strin- 
gency (values of a and 8) chosen for 
the test. 

Ina footnote, Burke (1, p. 385) criti- 
cizes this formulation as it appeared 
earlier (4, p. 45) on the grounds that 
it confuses hypotheses to be tested 
with hypotheses to be guarded against 
asalternatives. To the contrary, this 
statement of the problem clarifies the 
nature of the hypothesis under test. 
The hypothesis to be tested is! 


Ho: pe— He S0, 


where pe is the population mean for 
the experimental condition, p, is the 
population mean for the control con- 
dition, and the experimental predic- 
tion yields an alternative hypothesis, 


Hi: pe~ h> 0. 


Burke’s primary argument seems 
to rest upon the contention that 
“there is, under the single-tailed rule, 
no safeguard whatsoever against oc- 


1 An equivalent formulation of the one- 
tailed test model, of which the writer was un- 
aware at the time of his earlier note, is that 
tee by Dixon and Massey (2, pp. 100- 
104). 
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casional large and serious errors 
when the difference is in the unex- 
pected direction" (1, p. 385). Our 
formulation of the hypothesis under 
test completely resolves this diffi- 
culty, for no error is committed when 
that hypothesis is accepted on the 
basis of a large observed difference 
in the unexpected direction. The 
event is one of a class of events 
consistent with the hypothesis tested, 

If one were to retain the alterna- 
tive hypothesis, Hı, above, and to 
adopt a two-tailed statistical test, 
accepting H, when there were ob- 
served large differences between the 
means in either direction, his position 
would be unenviable. For, following 
the rules of his test, he would have 
to reject the hypothesis under test 
in favor of the alternative, even 
though an observed difference, pe~ He 
was a substantial negative value. 

The remaining discussion by Burke 
consists of pragmatic arguments 
against the adoption of one-tailed 
tests. The arguments appear valid 
only under the assumption that every 
application of the one-tailed test isan 
abuse of experimental methodology. 
Certainly, if (a) the test model is 
specified completely (including spec 
fication of the confidence level to be 
adopted) before the data are gath- 
ered, and if (b) the purpose of the test 
is only to determine whether a par- 
ticular directional prediction is sup- 
ported by the data, then the one- 
tailed test not only is appropriate 
but it is in error to use a two-tal 
test model. 
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In the discussion of one-tailed 
tests there are a number of issues 
which have not been clearly sepa- 
rated. An effort will here be made to 
separate them. 


Two TYPES or MODELS 


Psychologists have used two dif- 
ferent types of models in interpret- 
ing their data: statistical models and 
specifically psychological models. The 
two types have been employed in 
different ways and for entirely dif- 
ferent purposes. Statistical models 
have been used to determine whether 
data are intrinsically interesting or, 
in other words, whether the data 
indicate relationships between psy- 
chological variables. Psychological 
models have been utilized in attempts 
to organize the knowledge attained 
in various areas of the field. 

Early uses of statistical methods 
mixed the psychological and statisti- 
cal models, The substantive biases 
of the experimenter led him to seek 
out, with each set of data, those 
statistical techniques which sup- 
ported his views and to neglect the 
techniques which did not. Gradually, 
it was recognized that the function 
of the statistical model was prior 
to and separate from the function of 
the substantive model. Cognizant 
of the fallibility of their data and 
the greater fallibility of their theories, 
psychologists developed the view that 
data should be interpreted without 
the intrusion of the biases, however 
well founded, of the experimenter 

1 The writer thanks his colleagues, A. M. 


Binder and W. K. Estes, for their suggestions 
after reading an earlier draft of this reply. 


and accordingly accepted certain 
statistical procedures as conventions. 
The advocates of one-tailed tests 
would take us a large step backward, 
for they openly favor the mixing of 
the statistical and substantive mod- 
els. 

Any psychological theory worth 
bothering with will generate a number 
of predictions which can be directly 
checked against experimental data. 
Such direct determination of the 
consistency of data with the model is 
the only proper test of a psychologi- 
cal theory. In a check of this kind, 
there is a point by point verification 
of the theory, and strictly speaking, 
no statistical model is necessary. 
This does not mean that there should 
be no statistical interpretation of the 
data; it is usually wise to make the 
ordinary statistical interpretation 
prior to the theoretical check—to 
determine whether the data show 
enough to warrant a theoretical 
analysis. If, in any given experi- 
ment, a theory provides only a direc- 
tional prediction, there seems to be 
nothing wrong with our traditional 
procedure of first establishing the 
presence of a difference between 
groups and subsequently noting its 
direction. A statistical model should 
enter directly into the verification of 
a substantive theory only when it is 
based on a null hypothesis with re- 
spect to the residuals of the data from 
theoretical predictions. 

A statistical model can be viewed 
as a technique for assessing the relia- 
bility of an experiment without re- 
peating it. The question answered 
by a statistical test is whether or 
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not the sample obtained is a member 
of a certain class of samples. This 
class is usually defined by the null 
hypothesis to be tested and by the 
conditions under which the experi- 
ment would be repeated, if repetition 
were undertaken; frequently certain 
supplementary parametric assump- 
tions must be made. Since an experi- 
ment can be repeated without any 
appeal to a body of substantive 
theory, no such appeal need ever be 
made for purposes of statistical analy- 
sis, Experiments are often designed 
from theoretical considerations, and 
the conditions of repetition may, 
therefore, be dictated by a substan- 
tive model. In some analysis of vari- 
ance designs there may be freedom 
in the choice of an error term. This 
freedom indicates that the single 
experiment which has been per- 
formed, even in conjunction with a 
null hypothesis, does not define a 
unique class of repetitions. When 
this is the case, appeal to substantive 
theory may lead to the choice of a 
proper error term, and this appeal 
may seem to involve a direct intru- 
sion of substantive theory upon the 
statistical interpretation. As we 
have put the matter, however, it is 
readily seen that in this instance the 
theory does no more than define the 
conditions under which the experi- 
ment would be repeated. 


Tue ONE-TAILED TEST IN 
'THEORETICAL STATISTICS 


Properly described, the concept of 
a one-tailed test is clear and free 
from any objection on mathematical 
grounds. There is no disagreement on 
this point. Disagreement on what 
constitutes a proper description of 
a one-tailed test centers specifically 
on the nature of the hypothesis that 
is tested. Jones has emphasized his 
conviction that the hypothesis of a 
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nonpositive difference is tested 
against the alternative of a positive 
difference. I have stated that a null 
hypothesis is tested with safeguards 
against only a subset of the possible 
alternatives. 

As it now stands, the mathematical 
theory of testing statistical hypothe- 
ses requires that the hypothesis un- 
der test lead to the calcu'ation of a 
level of confidence. (Un:il the on- 
slaught of the one-tailers, books in 
psychological statistics cautioned us 
to test only "exact" hypotheses.) 
If we ask Jones to exhibit his calcula- 
tion of the level of confidence in a 
one-tailed test, we shall probably 
discover that he bases the calcula- 
tion on the distribution obtained 
under the null hypothesis. Then, 
according to the usual statistical 
logic, he is in fact testing the null 
hypothesis against selected alterna- 
tives. The comments of my earlier 
note were made within this frame- 
work. 

An easy modification of statistical 
logic can make the one-tailed test 
of a nonpositive difference tenable. 
Since the probability that he rejects 
the hypothesis of a nonpositive dif- 
ference when it is true can, in his 
procedure, never exceed the level of 
confidence calculated under the null 
hypothesis, Jones can declare that he 
has a bound on his level of confidence 
even though he does not know its 
precise value. Utilizing this bound, 
he can set up consistent rules for one 
tailed interpretation of the hypothe- 
sis he has stated. 

The two paragraphs above present 
what seem to be the only two post 
tions open to Jones. Whichever he 
accepts, I am in complete accord with - 
him so far as mathematical statistic 
goes, for both are : 
cally defensible. Yet his succinct 
assertion that I advocate unive 


i 
3 
clear and statisti- 


avoidance of one-tailed tests is cor- 


‘rect. The locus of disagreement is 
'to be found in practice, not in theory. 


APPLIED PROBLEMS 


In applied problems like Jones's 
example of the drugs, it is often true 
that certain practices or routines will 
‘be changed if, and only if, an experi- 
ment yields a reliable difference in a 
Specified direction—if the new is reli- 
ably better than the old. For prob- 
lems like this, Jones maintains that 
the two-tailed test is inappropriate. 
‘Verbally his argument proceeds well, 
but before we accept it, let us note 
that problems of this kind have never 
given any difficulty—we have always 
understood what we did and why. 
Let Jones use his one-tailed test at 
the 5 per cent confidence level to 
analyze data from a large number of 
such experiments. We shall analyze 
the same data using a two-tailed test 
of the null hypothesis but permitting 
ourselves a luxuriously loose 10 per 
cent confidence level. Being sensible 
men, we shall not advocate a change 
to the new routine if the old one 
proves reliably superior. Unless we 
wish to differentiate a null from a 
negative difference (we might, for 
example, consider a replication neces- 
sary unless the null hypothesis were 
rejected), the two procedures will 
obviously lead to the same recom- 
‘mendations with every set of data. 
Thus, they have identical conse- 
quences for the immediate practical 
decision. 

For myself, I do not much care 
"Whether one uses a one-tailed or two- 
tailed model in such applications. 
Since the choice of a level of confi- 
dence is arbitrary and since identical 
decisions will always be made, we 
have complete equivalence between 
the one-tailed model and a doubled 

level of confidence in our traditional 
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procedure. It would be wrong for 
either Jones or me to pretend that 
compelling reasons exist for preferring 
one of these ways of talking about 
applied experiments over the other. 
Any appeal to the risks of the two 
kinds of errors is little more than an 
uninteresting parlor game. Since no 
gain results, there seems to be in- 
sufficient reason for changing the 
traditional two-tailed interpretation, 
but this is not a very strong objection. 
In my earlier note, I conceded the 
admissibility of the one-tailed test 
for the experimenter's own laboratory 
planning; I am quite willing to add 
the applied experiment to the list. 


Tue PUBLIC LITERATURE 


There do seem to be compelling 
reasons why the use of one-tailed 
tests in our permanent reports must 
be rejected. These reasons are of two 
kinds: gne kind residing in the nature 
of the scientific enterprise and the 
functions of the experimental litera- 
ture, and the other associated with 
the debasing effect upon the litera- 
ture that is almost certain to result 
from adopting one-tailed tests. 

Experimental scientists must have 
for data a permanent respect that 
transcends their passing interest in 
the stories they make up about their 
data, Science is a public enterprise. 
These are statements to which most 
experimental scientists would sub- 
scribe. One-tailed tests violate both 
the spirit and the letter of these 
statements. Interest in whether a 
relationship is found between two 
variables in a given experiment is 
seldom confined to those who share 
the theoretical preconceptions of 
the experimenter, and the right to 
discuss experimental data in rela- 
tion to a particular theory does not 
remove the obligation to interpret 
the experiment according to rules 


590 


that can be accepted by the reader 
who rejects your views as well as by 
the reader who shares them. Our 
experience has shown that experimen- 
tal results can be viable for years 
after they are first reported even 
when the experiments are designed 
on the basis of moribund theoretical 
considerations. Let us consider our 
obligation to future psychologists who 
might not be able to understand why 
we were so naive in 1954 as to pre- 
dict that direction for our experi- 
mental results. Why should we put 
needless difficulties in the way of 
any reader, present or future? 

Why, particularly, when the 
change over to one-tailed test is al- 
most certain to gain nothing and lose 
much? Jones asserts that my dire 
forecasts on the effects of one-tailed 
tests on our literature are predicated 
on the assumption that every one- 
tailed test is a misuse. Here he has 
missed the fundamental point of my 
argument. Asin applied experiments, 
there is pragmatic equivalence be- 
tween using the one-tailed test and 
doubling the level of confidence in 
the two-tailed test, for if the discrep- 
ancy is in the expected direction, the 
power of these two tests is identical, 
and experiments should be, and usual- 
ly are, set up with power as a fore- 
most consideration. 

Most psychologists would resist 
the general adoption of the 10 per 
cent level for rejecting hypotheses. 
Such a low level of aspiration would 


jects per experiment and th 
complish a gradual attrition in 
soundness of our reported conc 
sions. This is precisely what will ha 
pen if there is general, and edite 
acceptance of the one-tailed t 
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A diminution in the quality of¢ 
psychological literature may be 
little moment to the theore 
statistician but must be of 
concern to the experimental psy 
ogist. Once they have the a 
of the mathematical statisticiat 
the procedure is logically defen 
experimenters seem so eager to! 
ploy one-tailed tests that, sociol 
cally speaking, I have lost the 
ment almost before it has been 
Of course, the procedure is n 
matically sound, but psycho 
are mistaken if they believe 
mathematical statistician can spe 
with special authority in th 
The decision is for us alone; it 
us to ponder whether, in the in 
tation of our data, the one-ta 
test is wise. We are drifting inte 
unfortunate decision, and I, for © 
wish to enter a plea for a little | 
methodology and a little more W 
dom. We expound a distinction. 
tween experimental hypotheses 
statistical hypotheses to our Deg 
ning students; let us maintain. 
distinction for ourselves. 
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Woopwortn, R. S., & ScutossERG, 
H. Experimental p . (Rev. 
Ed.) New York: Holt, 1954. Pp. 
xi--948. $8.95. 


It is a pleasure indeed to note the 
appearance of this revised edition of 
one of psychology's classics, Wood- 
worth's Experimental Psychology, 
which has graced the shelves of every 
graduate student for a generation, 
In a real sense the tradition is even 
longer, for the spirit of this revision 
is very much the spirit of Ladd and 
Woodworth's Physiological Psychol- 
ogy which an older generation will 
remember (and perhaps that of the 
still older Ladd, which this writer 
willingly confesses he does not re- 
member!). The revised edition is a 
concretely objective account of the 
principal facts discovered by psy- 
chologists in the areas of sensation 
and perception, learning, and the 
emotions, with frequent appeal to 
established physiological mechanisms 
for explanation. 

It should be emphasized at once 
that this is truly a revision and not 
just a freshened-up second edition. 
While the range of material, and even 
many of the chapter headings corres- 
pond closely to the Woodworth book, 
the treatment of some topics has 
undergone very substantial change. 
Learning, in particular, has been 
almost completely reorganized. Three 
of the old chapters have disappeared: 
Retention, Memory for Form, and 
Practice and Skill. Three new chap- 
ters have been added: Introductory 
Survey, Discrimination Learning, 
and Motivation in Learning and 
Performance. New names appear: 
Hull, Spence, Skinner, Tolman, Hil- 
gard, Hovland, while older names 


take second place, G. E. Miller, 
Ebbinghaus, Pavlov, and Carr. 

Changes in other sections of the 
book are less drastic. Feeling, Ex- 
perimental Esthetics, and Reading 
are not given separate treatment, 
and the chapters on Problem Solving 
and Thinking are combined into one 
shorter chapter. The subject of emo- 
tion is given a new, substantially 
more physiological, treatment. Of 
the sensory chapters, Vision, in par- 
ticular, reflects the renewed emphasis 
which the war gave to this side of 
experimental psychology. 

It is always tempting to a reviewer 
to seek some measure of the amount 
of change which has been made in 
revising a book. In this case, a list 
was made of the most frequently 
cited (indexed) authors. Highest ten 
in the 1938 book were, in order, G. 
E. Müller, Wundt, Ebbinghaus, 
Köhler, Cattell, Dodge, Judd, Thorn- 
dike, Pavlov, and E. H. Weber. Of 
these, only Thorndike survives in 
the 1954 list. The list now runs 
Hull, Boring, Hilgard, Thorndike, 
Stevens, Lashley, Graham, Wever, 
Hovland, Tolman. Quite a change! 

The present book is roughly 25 
per cent larger, yet less attention is 
given to any one person. This makes 
doubly surprising the fact that Hull 
is cited almost twice as frequently 
as any single person in the 1938 edi- 
tion. If anything is suggested by 
this tabulation, it is that there is 
considerably more dependence in 
this book on secondary sources. The 
frequent references to Hull, Boring, 
and Hilgard, for instance, are rarely 
to original papers, but much more to 
their organized judgments about the 
facts in a given experimental field. 
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The critica: review of a book of 
this kind is not easy. It contains too 
many detailed facts. The perspec- 
tives are rarely global. There is the 
added uncertainty of which of the 
two authors has seen a problem in a 
given way. It would be easy, there- 
fore, to add some comment at this 
stage recommending this book for 
its undoubted excellence, and ad- 
vising teachers and librarians that it 
merits their special attention. Both 
comments are true. But the real test 
of these propositions lies in the popu- 
larity which Experimental Psychology 
has enjoyed over the past 16 years, 
and in whatever degree of acceptance 
the revised edition may find in the 
next decade. It is of much more 
interest to see the kind of reflection 
which the book affords us of the con- 
temporary state of affairs in experi- 
mental psychology. 

1. The strongest impression of this 
reviewer is the striking unevenness in 
the development of various parts of 
the field. Some of this may proceed 
from the very catholicity of interest 
on the part of the authors. Or it 
may be that there are larger gaps be- 
tween older and younger generations 
of psychologists than some of us 
would care to admit. Nevertheless, 
the diversity in approach, in sophis- 
tication as to method, and in the 
formalization of results strikes the 
reader repeatedly in trying to read 
such a book as this from front to 
back. The first three chapters (skip- 
ping an Introduction) on Reaction 
Time, Association, and Attention 
present almost sheer anachronism. 
The problems, objectively conceived, 
certainly are problems with which the 
experimental psychologist wants to 
deal, but scarcely in this intellectual 
framework. But the authors' trouble 
was obviously how to recast the older 
facts to fit them to a currently ac- 
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ceptable pattern. Where, for in- 
stance, is the body of modern evi- 
dence to which old facts about set 
and attitude and the selectivity of 
attention should be related? 

To take another contrast, there is 
the curious difference between the 
field of learning on one hand and the 
sensory fields on the other. There 
is no question that great changes 
have taken place in the field of learn- 
ing. To this the revised edition is 
ample testimony. Neither “rein- 
forcement" nor the "law of effect" 
can be found in the 1938 index; 
their omission in the 1954 book would 
be unthinkable. And yet the field 
of learning is full of doctrinal con- 
troversy. Is there one, or are there 
two kinds of conditioning? Can we 
depend on the shape of the generaliza- 
tion gradient? What about the con- 
flicting views of the nature of extinc- 
tion? In short, theory building and 
the effort at semantic clarification 
seem to be the only analytic tools at 
hand. In contrast, nothing frightens 
the sensory psychologist so much as 
a bit of theory. His experiments are 
brutally detailed ; their plan, entirely 
parametric. For explanation, he takes 
flight into physiology. Is this, rather 
than theory building, the syndrome 
of a mature science? Certainly, a$ 
presented in the revised edition, these 
two areas of interest follow an utterly 
different pattern, and the difference 
is puzzling indeed. 

2. Not unrelated to the uneven- 
ness in development and in metho 
is the lack of an overview of what 
psychology's problems are, where they 
come from, and to what consequences 
they lead. Like Topsy, most of the 
questions in this book “just growed. 
For the expert or for the technician 
this deficiency is unimportant. He 
does not go to a factual compendium 
such as this to answer the question 


BOOK REVIEWS 


about why, as a psychologist, he 
wants to know this or that. But it is 
not entirely unimportant for the 
graduate student who reads a treatise 
of this kind. He is more likely to be 
forming some notion of the way in 
which an experimental psychologist 
regards human experience and human 
behavior. One thing this book makes 
clear—the methods to be followed 
are experimental. Concrete methods 
are repeatedly detailed. But often 
the reader, at least this reader, gets 
the impression that we drift into 
experimental work on an issue with- 
out keeping our original objectives 
in mind. 

Let us take the question of the 
GSR and emotion. Very roughly 
the conclusion seems to be that the 
GSR is probably related to some- 
thing called the level of activation. 
Or at least the GSR reflects changes 
in the level of activation. But the 
GSR does not have much to do with 
what has traditionally been called 
emotion and that was the broad 
class of behavior with which the dis- 
cussion started. Perhaps the term 
emotion is just another dead dodo, 
but the reader may feel that the GSR 
is a Trojan horse that takes the field 
without a battle. Too often, the dis- 
cussion slips from a problem to a 
method, and from the method to 
controversy about the method. This 
latter is not always psychology. 

3. When psychology deals with 
questions asked as often for method- 
ological as for substantial reasons, 
systematic comparisons of the find- 
ings in related fields are likely to be 
lacking. For instance, a great deal 
of the discussion of vision hinges 
around stimulation with short flashes 
of light. The parallel experiments on 
the temporal course of stimulation 
in other sense modalities come in for 
rather little discussion and the possi- 


593 


bility of a common underlying mecha- 
nism does not come up. The same 
kind of question is, however, recog- 
nized with regard to adaptation—and 
remains unanswered, Perhaps the 
contrast is most striking if one recalls 
the almost overelaborate treatment 
of interrelated experimentsin Koffka's 
Principles of Gestalt Psychology, or 
the parallel effort in Hull's Princi- 
ples of Behavior. It is indeed a wise 
man who can steer the way between 
the Scylla of theoretical vacuity and 
the Charybdis of a mere taxonomy. 
This reviewer would argue that the 
authors have sailed a bit too close to 
the whirlpool. 

It would not be proper to close 
this review without a word of ap- 
preciation for the real scholarship 
that lies behind this revised edition, 
quite as much as it did in the case of 
the 1938 Experimental Psychology. 
Everyone who sits down with the 
book, though he will miss some de- 
tails of topics near to his own inter- 
ests, may as often find the discussion 
cogent and complete. But rarely, 
very rarely, will some mention of an 
important line of research be entirely 
lacking. Within certain limits as to 
subject matter, there is amazingly 
little that has not been at least con- 
sidered by the able authors. This in 
itself is no mean achievement. 

Epwin B. NEWMAN 

Harvard. University 


WEITZENHOFFER. A. M. Hypnotism: 
an objective study in suggestibility. 
New York: Wiley, 1953. Pp. 
xvi+380. $6.00. 


A good deal of experimental work 
in the fields of suggestibility and hyp- 
nosis has been done since Hull pub- 
lished Hypnosis and Suggestibility: 
An Experimental Approach. A num- 
ber of books have been published in 
this field too, but they have usually 
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been of a speculative character or 
have dealt only with limited aspects 
of the field. Hypnotism: An Objective 
Study in Suggestibility is the first real 
attempt to pull together all the avail- 
able experimental material, to assess 
it critically, and to draw whatever 
conclusions may be justified on the 
basis of the findings. The attempt 
was well worth making, and the sci- 
entific care and objectivity of the au- 
thor have succeeded in making this 
an indispensable book for anyone 
interested in the theory and practice 
of hypnosis, or doing research in this 
field. 

The book is divided into four parts, 
the first of which gives a general ori- 
entation and deals with such topics as 
the criteria and stages of hypnosis, 
tests of suggestibility, and so forth. 
The second part deals with experi- 
mental studies of the intrinsic and 
part 3 deals with the extrinsic char- 
acteristics of suggestibility and hyp- 
nosis. This differentiation is not 
clearly defined by the author. By 
intrinsic he seems to mean such things 
as the factors influencing suggesti- 
bility, the interrelationship of hypno- 
sis and suggestibility, and ''animal 
hypnosis." By extrinsic he seems to 
mean the effects of suggestion and 
hypnosis on sensory and perceptual 
functions, on memory and learning, 
on organic functions, and so forth. 
Some of the chapters included in one 
part, however, would seem to fall 
more easily into the other, if this be 
indeed a correct interpretation of the 
author's wish; altogether this division 
does not seem to be a very happy de- 
vice. The last part deals with a re- 
view of theoretical formulations and 
presents the author's own theory. 

Weitzenhoffer accepts the division 
of suggestibility into primary, sec- 
ondary, and tertiary, and builds his 
account around these notions of inde- 
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pendent factors. In doing this, how- 
ever, he is not always consistent, 
Thus, on page 33 he quotes with ap- 
proval data showing that neurotics 
are more suggestible on a test of pri- 
mary suggestibility; a few lines later 
he quotes another investigator's re- 
sults showing that neurotics are more. 
suggestible on a test of secondary sug- 
gestibility. Weitzenhoffer concludes 
that the latter confirms the earlier - 
work. Now if primary and secondary 
suggestibility are indeed independent 
factors, a test of one type cannot it- 
self confirm results obtained from a 
test of the other type. This tendency 
to have his cake and eat it too (i.e 
to regard two factors as independent, 
but yet to assume that tests of one 
type can confirm results obtained by 
tests of the other) occurs several 
times throughout the book and mx 
be misleading to readers not familiar 
with the meaning of orthogonal fac-- 
tors. 

On most occasions, Weitzenhoffer's 
conclusions from the evidence are un- 
exceptionable and will be accepted by 
most students of the subject. O 
sionally, however, he seems to mis: 
the mark. Thus, in his summary © 
the literature dealing with the rela: 
tionship between neurosis and sug 
gestibility he ascribes the contra 
tions found to the fact that di 
ences are observed when psychiatri 
diagnoses are used, but not wh 
neuroticism inventories are used. A 
different summary might be that 
ferences are found when severe 
rotics are compared with no 
people, but not when we restrict t 
range so much that only normals 
included in the sample. 

The documentation of the boo! 
very thorough indeed, the bi 
raphy running to over five hun 
titles, all well selected. The re 
has been able to find only one maj 


omission that would alter the conclu- 
sions Weitzenhoffer draws. In his 
discussion of hypnosis and dissocia- 
tion (p. 99), Weitzenhoffer quotes the 
work of Mitchell, which is negative, 
but not that of Nagge, which is posi- 
tive. Since it is derived from a larger 
number of cases, the latter appears 
more trustworthy and would have 
altered the conclusion drawn if it had 
been considered. This is important, 
for it determines the author's atti- 
tude to the dissociation hypothesis. 

On much disputed points like the 
existence of animal hypnosis, or the 
possibility of inducing antisocial acts 
under hypnosis, the author sums up 
with impartiality and bases his con- 
clusion entirely on the evidence with- 
out showing any tendency to try to 
make the evidence agree with pre- 
formed conclusions. This tendency to 
follow the facts rather than to try to 
press them into the service of some 
general theory makes this quite an 
unusual book in the field. 

The major new contribution of the 
book lies in the theory advocated by 
Weitzenhoffer to account for the 
phenomenon of hypnosis. This is a 
three-stage theory: “Hypnosis is the 
outcome of not one but of several tem- 
porally overlapping and interacting 
processes." The two main ones the 
author considers to be those of homo- 
action and generalization; in the 
later stages of hypnosis a third 
process, namely a marrowing of con- 
sciousness, is introduced. “All of 
these complement each other, and are 
interwoven, although each appears to 
have a different mechanism of action 
and different end results when con- 
sidered singly." This theory is too 
complex to be considered in any de- 
tail here. It seems to have the usual 
advantages and disadvantages of 
theories involving the interaction of 
several processes, none of which can 


be specified or measured with any de- 
gree of accuracy. It would probably 
be possible to account for most hyp- 
notic phenomena along the lines of 
this hypothesis, but it would be very 
difficult to make predictions which 
could invalidate it. Another weak- 
ness of this theory seems to be that 
although certain aspects of the theory 
involve learning, and presumably also 
drive reduction, the author has no- 
where tried to bring his formulation 
into line with modern learning theory. 
Such an attempt might have resulted 
in testable predictions, thus strength- 
ening the acceptability of the theory. 
In spite of these criticisms, theoreti- 
cal formulations in this field have 
generally been so obviously untena- 
ble that Weitzenhoffer's formulation 
stands out as a much more reasonable 
interpretation of the very complex 
phenomena under discussion than 
any alternative theory known to the 
reviewer. Perhaps it would be unfair 
to ask for more than that at this 
stage of progress. 
H. J. EvsENCK 
University of London 
Rog, ANNE. The making of a scien- 
list. New York: Dodd, Mead & 
Co., 1953. Pp. ix--244. $3.75. 
Since scientific discoveries have 
played a central role in initiating the 
vast changes in human life which 
have characterized the past two cen- 
turies, one cannot doubt the impor- 
tance of obtaining better understand- 
ing of creative investigators. Dr. 
Roe for several years has been con- 
ducting studies of research scientists. 
She has obtained test data as well as 
biographical and personal interview 
data from a considerable number of 
the leading scientific men of today. 
Her original reports of her findings 
have appeared in monograph form 
and in psychological journals. The 
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present volume is a summary ac- 
count of her investigations written 
for the layman. It contains no new 
data. The style is informal and easy. 
However, the reviewer is inclined to 
believe that most readers who are 
interested in "the making of a scien- 
tist," will find the presentation too 
elementary. Furthermore, if the 
reader should become interested in 
delving further into the character- 
istics of scientists, this book will not 
assist him, for he will find in it few 
references to other publications, in- 
cluding Dr. Roe's own research re- 
ports. This is unfortunate, for many 
psychologists, as well as some general 
readers, would find a bibliography 
useful. 

The critical reader will doubtless 
take issue with some of the conclu- 
sions which Dr. Roe arrives at from 
studies of quite small groups of sub- 
jects whose representativeness is 
unknown. However, all in all, Dr. 
Roe is to be commended for taking 

. the pains to make available to inter- 
-ested laymen an account of her work. 
Psychologists too seldom undertake 
to communicate with other than 
psychologists. 
WAYNE DENNIS 
Brooklyn College 


DOLLARD, JOHN, AULD, FRANK JR., 
& WHITE, ALICE. Steps in psycho- 
therapy. New York: Macmillan, 
1953. Pp. ix+222. $3.50. 


For reasons which are no longer 
valid, psychotherapists for over half 
a century have failed to make avail- 
able for public inspection the raw 
data from which their hypotheses and 
conclusions have been derived. The 
procedure of relying upon the sub- 
jective interpretations of an ego- 
involved therapist-investigator is 
gradually being replaced, fortunate- 
ly, by typescripts of phonographic 
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recordings. Although the pre 
book presents only one ve 
interview from a series of 17 
with a patient, with the rem 
interviews being abstracted fro 
recordings, the reader gains a defini 
impression of intimacy not only 
the patient but also with the therae 
pist's thinking about the patient. 
addition, the verbatim commei 
of the supervisor appear on the 
page in which the interview ma 
is presented. Thus the book presenti 
not only the interpretations and coi 
clusions of the authors about à 
year-old woman treated by 
psychoanalytically oriented ps 
therapy, but also includes a good 
of the raw data of treatment obtain 
not only from the therapist, but 
from the patient and the supervi 
A final section of the book pret 
a verbatim account of several 
administered to the patient prior. 
therapy which were interpre 
blindly and independently by two 
clinical psychologists. The autho 
attacks upon the psychological 
ports are somewhat unconvincing 
view of the limited testing and 
rapport with the patient d 
examination. i 
Apparently intended as some 
of a casebook for the earlier boo 
Dollard and Miller, Personality 
Psychotherapy (1950), the introd 
tory chapters of Steps in Psychothe 
apy summarize very briefly 
learning theory interpretation of. 
therapeutic process and provide | 
extremely lucid résumé of the th: 
peutic principles presented in 1 
earlier book. Although the 17 im 
views are intended to illustrate 
principles expounded, the rele 
of the learning theory is not n 
particularly discernible in the sup 
visor's extended comments. 
learning theory appears essentiall: 


be an apologia for the particular 
brand of brief analytic therapy which 
is presented. Only in the continued 
insistence on the therapist's duty to 
help the patient to "label" his ex- 
periences does there seem to be much 
extension of or departure from the us- 
ual procedures of brief, analytic treat- 
ment methods. 

Regardless of one's point of view 
toward the brand of therapy advo- 
cated, the interested reader is in a 
position to inspect the supervisor's 
hypotheses and, to some extent, con- 
sider the evidence on which they were 
based as well as the effect the hy- 
potheses had upon the patient when 
the therapist followed instructions. 
It is regrettable that space limitations 
prevented the authors from publish- 
ing more than selected and edited 
excerpts of the 16 interviews. Such 
questions as the following indicate 
the possibilities of using the book as 
a teaching device, although the an- 
swers can't be found in the book be- 
cause of the limited material pre- 
sented: (a) Was the early diagnosis of 
sexual maladjustment as the major 
problem justified, or did it stem from 
the theoretical bias of the supervisor? 
(b) Was the patient rewarded pri- 
marily for discussing sexual material 
and "punished" for other topics of 
discussion which might have been 
important? (c) Did the mother and 
husband play such minor roles in the 
patient's problems as the authors 
indicate? 

In a charming and disarming sen- 
tence the authors "expect to find 
most helpful the views of those com- 
mentators who have themselves pub- 
lished their own uncensored remarks 
as therapists or supervisors." Al- 
though the drive revealed by this cue 
is commendable in a world where 
justice should reign supreme, the ex- 
pectation is hardly cricket since it 
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reduces the universe of “qualified” 
commentators to perhaps five or six 
persons now living. 

This reviewer will rest content with 
the observation that the edited case 
material fastens so insistently upon 
the sexual conflict theme that there 
is little opportunity for the reader to 
determine whether any alternative 
explanations of the patient's symp- 
toms might have been tenable. Such 
a question does not, of course, de- 
tract from the real value of the book 
in illustrating a particular approach 


to therapy. 
Victor C. Ramy 

University of Colorado 
Picxrorp, R. W. Individual dif- 

ferences in colour vision. London: 

Routledge & Kegan Paul, Ltd., 

1951. Pp. xviii--386. 30s. 

This book is primarily a detailed 
report of seven experiments on color 
vision carried out over a period of 
more than eight years with some 1100 
observers. The research was designed 
to develop a diagnostic test of color 
vision, to help clarify the existing 
confused classifications of color blind- 
ness, and to test the adequacy of the 
Young-Helmholtz theory of color 
vision. 

These aims were achieved with 
considerable success. Pickford's four- 
color test, which was used in his main 
experiment, turns out to be one 
which can be readily adapted to the 
clinical testing situation to give ob- 
jective diagnostic information. His 
test data provide an operational 
basis for defining categories of nor- 
mality, of color weakness, of anoma- 
lies, and of major defectives. His re- 
sults point strongly toward a theory 
of color vision much like Houston's 
modification of the Hering theory, 
and can be fitted to the Young- 
Helmholtz theory only with unpar- 
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simonious additions to the basic 
trichomatic notion. 

Although Pickford himself con- 
siders the work he has done as more 
preliminary than definitive, there are 
several important contributions 
which this research has made to the 
field of color vision testing. Primary 
among these is the fact that diag- 
nostic categories are clear cut and 
objectively defined. A statistical 
procedure is used for defining “‘de- 
viant” and “color weak" groups as 
falling respectively in the 1-to-2 and 
2-to-3 sigma brackets of the normal 
distribution, and justifies separating 
them from the "anomalous" who de- 
viate discontinuously out as far as 
the 5-to-7 sigma brackets. Prota- 
nopes and deuteranopes are dis- 
tinguished in terms of their much 
greater color-matching variability 
and on a basis of their brightness 
matches. He reports that there are 
few “pure” dichromats (using the 
term in its traditional sense), but 
shows how the "nonpure" dichromats 
can be separated from the anomalous 
trichromats. 

Another important contribution is 
the demonstration that individuals 
who may by some tests be considered 
normal, may at the same time show 
an independent weakness in any one 
of four spectral regions: red, yellow, 
green, and blue. Pickford shows, 
however, that there is likely to be 
some relationship between yellow and 
blue, or red and green sensitivity. 

Of great practical value to those 
who must give color vision tests in 
industry or elsewhere are the pre- 
cautions, described in detail, which 
show how the clinician should ap- 
proach an individual to be tested so 
that the test will not be invalidated. 
The color defective has learned to use 
extra cues and all sorts of ruses toget 
by in his everyday life and frequently 
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is a better examiner of the um 
tester than the tester is of him. 
There will be some readers who wil 
be critical of the general exp 
mental technique used. From 
standpoint of classical psychophy 
ics, it may appear that the data à 
skimpy. Typically, one observat 
by a bracketing procedure was 
only basis for scoring a single co 
matching judgment, and a 
match with each of four colors ga’ 
all the diagnostic information. 
tained for a single observer. Then 
are, however, several factors 
make these data considerably me 
than minimally acceptable. For om 
thing, the colorimetric techniq 
used in the main experiment 
herently precise, and the brac! 
procedure carried out by the € 
menter was carefully designed to 
a measure of variability as well as on 
of central tendency. One thing tor 
member is that the author intende 
to develop a practical testing pri 
cedure so that the whole test could b 
given in a matter of 10 or 15 minute 
Pickford did run a test-retest reli 
bility series on 36 observers W 
gave coefficients ranging from .99 
.98 for the more important a 
green test colors. The author inter 
to extend this elaborate and tim 
consuming preliminary study, an 
has already organized a survey of 
least 20,000 British observers. — - 
Although Pickford’s classification 
categories are objectively define 
they are specified in terms of the arb 
trary scales of his colorimeter. 4t 
unfortunate that he did not 
internationally standardized p: 
physical specifications (C.I.E.) for? 
test colors, and that he did not spe 
ify his data in these generally ¢ 
prehensible units for compare? 
purposes. 
Some criticism could be mac 
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the organization of the book. In 
general, a chronological sequence of 
experiments is followed, but it seems 
cluttered at times with repetition, 
with the inclusion of too much detail 
on the subjective reports of isolated 
individuals, and with introspective 
reports of the author. 

In conclusion it may be said that 
no person who must test the color 
vision of others should omit studying 
this book carefully. As Professor 
Bartlett said in his Foreword to the 
book, ‘There is no other book I know 
in which the general methodology of 
this type of psychophysical investiga- 
tion is dealt with anything like as 
well,” 

Ropert W. BURNHAM 

Eastman Kodak Company, 

Rochester, New York 


Wert, JAMES E., Nett, CHARLES 
O., & AHMANN, J. STANLEY. Sta- 
tistical methods in educational and 
psychological research. New York: 
Appleton-Century-Crofts, 1954. 
Pp. viii--435. $5.00. 


This is a lengthy but chatty intro- 
duction to descriptive and inferential 
statistics which progresses at a peda- 
gogical pace suitable to the median 
student. Many detailed examples are 
well presented with emphasis on un- 
derlying rationale and assumptions. 
Arithmetic and high school algebra 
are the only requisites. The first five 
chapters present the customary pro- 
cedures for collecting and summariz- 
ing data including central tendency, 
percentiles, variability, and correla- 
tion. Three chapters are then de- 
voted to the concepts of sampling and 


statistical inference. The remaining 
11 chapters deal with chi square, 
analysis of variance and covariance, 
linear and curvilinear regression, and 
elaborations of correlation. Content- 
wise the book's chief claim to atten- 
tion in the elementary market-place 
is its more than ample discussion, and 
certain unusual material such as 
double classification analysis of vari- 
ance with di ionate frequen- 
cies and the application of the two- 
class discriminant function. A 
worthy contender for the first two 
or three semesters. 
LEoNARD S. KOGAN 
Institute of Welfare Research, 
Community Service Society of 
New York 


Unperwoop, BENTON J., DUNCAN, 
Cart P., TAYLOR, Janet A., & 
Corton, Jon W. Elementary sla- 
tistics. | New York: Appleton- 
Century-Crofts, 1954. Pp. ix +239. 
$3.25. 

This little book is designed for use 
in the introductory uate 
one-semester course. Coverage is 
classic and includes handling of dis- 
tributions, percentiles, central ten- 
dency, variability, correlation, the 
normal curve, significance of differ- 
ences between means, simple analysis 
of variance, and chi square. It is well 
written and pitched toward problems 
of inference in as are anum- 
ber of other elementary texts. 

LEONARD S. KOGAN 

Institute of Welfare Research, 

Community Service Society of 
New York 
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